🔬 LLM Benchmark Report

Run ID: 20251231_130358_progress | Generated: 2025-12-31T13:18:05.353178

Models Tested

7

Total Evaluations

4040

Datasets

5

Languages

2

Avg Latency

3113ms

📋 Dataset Configuration

Her dataset için score range ve prompt özeti

Dataset Score Range Task Type Language Prompt Template Summary
persuade 1 - 6 scoring EN Rate text from 1 to 6
turkish_movie_sentiment 0 - 5 sentiment TR Classify/evaluate text
turkish_sentiment_large 0 - 2 sentiment TR Classify/evaluate text
turkish_offensive 0 - 1 classification TR Classify/evaluate text
brighter_emotion 0 - 3 emotion_intensity MULTILINGUAL Classify/evaluate text

📊 Model Performance (Advanced Metrics)

Model N MAE ↓ NMAE ↓ QWK ↑ Accuracy Pearson Spearman Bias Latency
litellm/hermes-3-405b 573 0.462 0.092 0.896 92.0% 0.892 0.861 -0.26 8966ms
litellm/llama-3.3-70b 589 0.540 0.108 0.860 92.4% 0.882 0.867 -0.30 1355ms
litellm/mistral-small 574 0.544 0.109 0.853 86.9% 0.875 0.857 -0.40 2247ms
litellm/gemma-3-27b 576 0.551 0.110 0.836 87.5% 0.840 0.817 -0.24 2999ms
litellm/openai/gpt-4o-mini 573 0.554 0.111 0.868 92.8% 0.869 0.820 -0.29 2586ms
litellm/llama-3.1-8b 579 0.632 0.126 0.801 88.9% 0.804 0.805 +0.00 1038ms
litellm/llama-3.2-3b 576 0.635 0.127 0.839 92.4% 0.831 0.798 -0.00 2676ms

📈 Strategy & Scoring Mode Performance

Her model-dataset-strategy-scoring mode kombinasyonu için detaylı metrikler

Model Dataset Strategy Scoring Range N MAE ↓ QWK ↑ Accuracy Pearson Spearman Bias
gemma-3-27b brighter_emotion few_shot 📊 cont 0-3 24 0.583 0.467 83.3% 0.621 0.586 +0.14
gemma-3-27b brighter_emotion few_shot 🔢 disc 0-3 24 0.625 0.391 87.5% 0.429 0.393 +0.12
gemma-3-27b brighter_emotion rubric_based 📊 cont 0-3 24 0.577 0.467 83.3% 0.628 0.615 +0.14
gemma-3-27b brighter_emotion rubric_based 🔢 disc 0-3 24 0.583 0.482 91.7% 0.520 0.487 +0.08
gemma-3-27b brighter_emotion zero_shot 📊 cont 0-3 24 0.575 0.559 79.2% 0.658 0.601 +0.13
gemma-3-27b brighter_emotion zero_shot 🔢 disc 0-3 24 0.625 0.391 87.5% 0.429 0.393 +0.12
gemma-3-27b persuade few_shot 📊 cont 1-6 3 1.500 0.000 0.0% 0.000 0.000 -1.50
gemma-3-27b persuade few_shot 🔢 disc 1-6 3 0.000 0.000 100.0% 0.000 0.000 0.00
gemma-3-27b persuade rubric_based 📊 cont 1-6 3 1.900 0.000 0.0% 0.000 0.000 -1.90
gemma-3-27b persuade rubric_based 🔢 disc 1-6 3 2.000 0.000 0.0% 0.000 0.000 -2.00
gemma-3-27b persuade zero_shot 📊 cont 1-6 3 1.383 0.000 0.0% 0.000 0.000 -1.38
gemma-3-27b persuade zero_shot 🔢 disc 1-6 3 1.000 0.000 100.0% 0.000 0.000 -1.00
gemma-3-27b turkish_movie_sentiment few_shot 📊 cont 0-5 18 1.003 0.717 55.6% 0.689 0.415 -0.61
gemma-3-27b turkish_movie_sentiment few_shot 🔢 disc 0-5 21 0.929 0.569 85.7% 0.682 0.502 -0.64
gemma-3-27b turkish_movie_sentiment rubric_based 📊 cont 0-5 18 0.989 0.717 61.1% 0.704 0.427 -0.63
gemma-3-27b turkish_movie_sentiment rubric_based 🔢 disc 0-5 18 1.083 0.556 83.3% 0.693 0.563 -0.75
gemma-3-27b turkish_movie_sentiment zero_shot 📊 cont 0-5 21 0.871 0.720 66.7% 0.679 0.363 -0.49
gemma-3-27b turkish_movie_sentiment zero_shot 🔢 disc 0-5 21 0.881 0.620 90.5% 0.721 0.573 -0.60
gemma-3-27b turkish_offensive few_shot 📊 cont 0-1 27 0.333 0.342 100.0% 0.350 0.350 -0.11
gemma-3-27b turkish_offensive few_shot 🔢 disc 0-1 27 0.296 0.419 100.0% 0.438 0.438 -0.15
gemma-3-27b turkish_offensive rubric_based 📊 cont 0-1 27 0.333 0.342 100.0% 0.350 0.350 -0.11
gemma-3-27b turkish_offensive rubric_based 🔢 disc 0-1 27 0.333 0.342 100.0% 0.350 0.350 -0.11
gemma-3-27b turkish_offensive zero_shot 📊 cont 0-1 27 0.333 0.342 100.0% 0.350 0.350 -0.11
gemma-3-27b turkish_offensive zero_shot 🔢 disc 0-1 27 0.333 0.342 100.0% 0.350 0.350 -0.11
gemma-3-27b turkish_sentiment_large few_shot 📊 cont 0-2 21 0.402 0.650 85.7% 0.707 0.637 -0.40
gemma-3-27b turkish_sentiment_large few_shot 🔢 disc 0-2 24 0.250 0.667 100.0% 0.817 0.800 -0.25
gemma-3-27b turkish_sentiment_large rubric_based 📊 cont 0-2 21 0.400 0.650 85.7% 0.707 0.638 -0.40
gemma-3-27b turkish_sentiment_large rubric_based 🔢 disc 0-2 21 0.286 0.650 100.0% 0.806 0.794 -0.29
gemma-3-27b turkish_sentiment_large zero_shot 📊 cont 0-2 24 0.367 0.667 87.5% 0.721 0.645 -0.37
gemma-3-27b turkish_sentiment_large zero_shot 🔢 disc 0-2 24 0.250 0.667 100.0% 0.817 0.800 -0.25
hermes-3-405b brighter_emotion few_shot 📊 cont 0-3 24 0.469 0.795 87.5% 0.814 0.860 -0.04
hermes-3-405b brighter_emotion few_shot 🔢 disc 0-3 24 0.500 0.743 100.0% 0.746 0.762 +0.08
hermes-3-405b brighter_emotion rubric_based 📊 cont 0-3 21 0.600 0.657 81.0% 0.745 0.619 +0.02
hermes-3-405b brighter_emotion rubric_based 🔢 disc 0-3 24 0.375 0.806 100.0% 0.813 0.859 +0.12
hermes-3-405b brighter_emotion zero_shot 📊 cont 0-3 24 0.525 0.726 83.3% 0.758 0.790 +0.03
hermes-3-405b brighter_emotion zero_shot 🔢 disc 0-3 24 0.375 0.806 100.0% 0.813 0.859 +0.12
hermes-3-405b persuade few_shot 📊 cont 1-6 3 0.500 0.000 100.0% 0.000 0.000 +0.50
hermes-3-405b persuade few_shot 🔢 disc 1-6 3 0.000 0.000 100.0% 0.000 0.000 0.00
hermes-3-405b persuade rubric_based 📊 cont 1-6 3 1.667 0.000 0.0% 0.000 0.000 -1.67
hermes-3-405b persuade rubric_based 🔢 disc 1-6 3 2.000 0.000 0.0% 0.000 0.000 -2.00
hermes-3-405b persuade zero_shot 📊 cont 1-6 3 0.500 0.000 100.0% 0.000 0.000 +0.50
hermes-3-405b persuade zero_shot 🔢 disc 1-6 3 0.000 0.000 100.0% 0.000 0.000 0.00
hermes-3-405b turkish_movie_sentiment few_shot 📊 cont 0-5 18 0.742 0.717 83.3% 0.853 0.781 -0.57
hermes-3-405b turkish_movie_sentiment few_shot 🔢 disc 0-5 21 0.786 0.724 100.0% 0.801 0.749 -0.50
hermes-3-405b turkish_movie_sentiment rubric_based 📊 cont 0-5 18 0.744 0.717 83.3% 0.851 0.771 -0.58
hermes-3-405b turkish_movie_sentiment rubric_based 🔢 disc 0-5 18 0.917 0.717 100.0% 0.809 0.783 -0.58
hermes-3-405b turkish_movie_sentiment zero_shot 📊 cont 0-5 21 0.686 0.724 81.0% 0.804 0.648 -0.45
hermes-3-405b turkish_movie_sentiment zero_shot 🔢 disc 0-5 21 0.881 0.696 95.2% 0.776 0.678 -0.50
hermes-3-405b turkish_offensive few_shot 📊 cont 0-1 27 0.222 0.571 100.0% 0.632 0.632 -0.22
hermes-3-405b turkish_offensive few_shot 🔢 disc 0-1 27 0.222 0.571 100.0% 0.632 0.632 -0.22
hermes-3-405b turkish_offensive rubric_based 📊 cont 0-1 27 0.222 0.571 100.0% 0.632 0.632 -0.22
hermes-3-405b turkish_offensive rubric_based 🔢 disc 0-1 27 0.259 0.496 100.0% 0.532 0.532 -0.19
hermes-3-405b turkish_offensive zero_shot 📊 cont 0-1 27 0.259 0.496 100.0% 0.532 0.532 -0.19
hermes-3-405b turkish_offensive zero_shot 🔢 disc 0-1 27 0.222 0.571 100.0% 0.632 0.632 -0.22
hermes-3-405b turkish_sentiment_large few_shot 📊 cont 0-2 21 0.402 0.540 85.7% 0.658 0.631 -0.40
hermes-3-405b turkish_sentiment_large few_shot 🔢 disc 0-2 24 0.375 0.412 87.5% 0.590 0.655 -0.38
hermes-3-405b turkish_sentiment_large rubric_based 📊 cont 0-2 21 0.400 0.540 85.7% 0.651 0.619 -0.40
hermes-3-405b turkish_sentiment_large rubric_based 🔢 disc 0-2 21 0.429 0.386 85.7% 0.564 0.617 -0.43
hermes-3-405b turkish_sentiment_large zero_shot 📊 cont 0-2 24 0.404 0.478 87.5% 0.650 0.621 -0.40
hermes-3-405b turkish_sentiment_large zero_shot 🔢 disc 0-2 24 0.375 0.412 87.5% 0.590 0.655 -0.38
llama-3.1-8b brighter_emotion few_shot 📊 cont 0-3 24 0.640 0.490 87.5% 0.712 0.731 -0.10
llama-3.1-8b brighter_emotion few_shot 🔢 disc 0-3 24 0.625 0.627 91.7% 0.630 0.710 +0.04
llama-3.1-8b brighter_emotion rubric_based 📊 cont 0-3 24 0.673 0.577 79.2% 0.685 0.727 -0.08
llama-3.1-8b brighter_emotion rubric_based 🔢 disc 0-3 24 0.708 0.495 87.5% 0.517 0.610 +0.29
llama-3.1-8b brighter_emotion zero_shot 📊 cont 0-3 24 0.675 0.497 79.2% 0.644 0.613 -0.15
llama-3.1-8b brighter_emotion zero_shot 🔢 disc 0-3 24 0.625 0.627 91.7% 0.630 0.710 +0.04
llama-3.1-8b persuade few_shot 📊 cont 1-6 3 0.700 0.000 66.7% 0.000 0.000 -0.43
llama-3.1-8b persuade few_shot 🔢 disc 1-6 3 0.333 0.000 100.0% 0.000 0.000 +0.33
llama-3.1-8b persuade rubric_based 📊 cont 1-6 3 1.500 0.000 0.0% 0.000 0.000 -1.50
llama-3.1-8b persuade rubric_based 🔢 disc 1-6 3 2.000 0.000 0.0% 0.000 0.000 -2.00
llama-3.1-8b persuade zero_shot 📊 cont 1-6 3 0.733 0.000 66.7% 0.000 0.000 -0.27
llama-3.1-8b persuade zero_shot 🔢 disc 1-6 3 1.000 0.000 100.0% 0.000 0.000 +1.00
llama-3.1-8b turkish_movie_sentiment few_shot 📊 cont 0-5 20 1.042 0.208 65.0% 0.259 0.222 -0.03
llama-3.1-8b turkish_movie_sentiment few_shot 🔢 disc 0-5 21 0.976 0.267 85.7% 0.352 0.356 +0.60
llama-3.1-8b turkish_movie_sentiment rubric_based 📊 cont 0-5 18 1.150 -0.058 55.6% 0.025 -0.124 +0.32
llama-3.1-8b turkish_movie_sentiment rubric_based 🔢 disc 0-5 18 1.028 0.240 83.3% 0.303 0.283 +0.25
llama-3.1-8b turkish_movie_sentiment zero_shot 📊 cont 0-5 21 1.077 0.241 52.4% 0.301 0.195 -0.02
llama-3.1-8b turkish_movie_sentiment zero_shot 🔢 disc 0-5 21 0.881 0.304 85.7% 0.394 0.465 +0.60
llama-3.1-8b turkish_offensive few_shot 📊 cont 0-1 27 0.481 0.017 100.0% 0.017 0.017 +0.04
llama-3.1-8b turkish_offensive few_shot 🔢 disc 0-1 27 0.556 -0.154 100.0% -0.158 -0.158 +0.11
llama-3.1-8b turkish_offensive rubric_based 📊 cont 0-1 27 0.481 0.017 100.0% 0.017 0.017 +0.04
llama-3.1-8b turkish_offensive rubric_based 🔢 disc 0-1 27 0.518 -0.068 100.0% -0.069 -0.069 +0.07
llama-3.1-8b turkish_offensive zero_shot 📊 cont 0-1 27 0.481 0.017 100.0% 0.017 0.017 +0.04
llama-3.1-8b turkish_offensive zero_shot 🔢 disc 0-1 28 0.536 -0.129 100.0% -0.133 -0.133 +0.11
llama-3.1-8b turkish_sentiment_large few_shot 📊 cont 0-2 21 0.326 0.650 100.0% 0.835 0.790 -0.33
llama-3.1-8b turkish_sentiment_large few_shot 🔢 disc 0-2 24 0.375 0.538 100.0% 0.662 0.574 -0.21
llama-3.1-8b turkish_sentiment_large rubric_based 📊 cont 0-2 21 0.336 0.650 95.2% 0.812 0.737 -0.34
llama-3.1-8b turkish_sentiment_large rubric_based 🔢 disc 0-2 21 0.381 0.582 100.0% 0.742 0.693 -0.29
llama-3.1-8b turkish_sentiment_large zero_shot 📊 cont 0-2 24 0.321 0.667 100.0% 0.853 0.757 -0.32
llama-3.1-8b turkish_sentiment_large zero_shot 🔢 disc 0-2 24 0.375 0.436 95.8% 0.536 0.513 -0.21
llama-3.2-3b brighter_emotion few_shot 📊 cont 0-3 24 0.594 0.696 87.5% 0.735 0.761 +0.16
llama-3.2-3b brighter_emotion few_shot 🔢 disc 0-3 24 0.458 0.796 100.0% 0.817 0.845 +0.21
llama-3.2-3b brighter_emotion rubric_based 📊 cont 0-3 24 0.698 0.622 79.2% 0.579 0.621 +0.22
llama-3.2-3b brighter_emotion rubric_based 🔢 disc 0-3 24 0.750 0.635 91.7% 0.656 0.702 +0.17
llama-3.2-3b brighter_emotion zero_shot 📊 cont 0-3 24 0.688 0.576 79.2% 0.680 0.711 +0.29
llama-3.2-3b brighter_emotion zero_shot 🔢 disc 0-3 24 0.542 0.762 95.8% 0.787 0.804 +0.04
llama-3.2-3b persuade few_shot 📊 cont 1-6 3 0.250 0.000 100.0% 0.000 0.000 +0.25
llama-3.2-3b persuade few_shot 🔢 disc 1-6 3 0.000 0.000 100.0% 0.000 0.000 0.00
llama-3.2-3b persuade rubric_based 📊 cont 1-6 3 1.167 0.000 33.3% 0.000 0.000 -0.83
llama-3.2-3b persuade rubric_based 🔢 disc 1-6 3 1.000 0.000 66.7% 0.000 0.000 -1.00
llama-3.2-3b persuade zero_shot 📊 cont 1-6 3 0.500 0.000 100.0% 0.000 0.000 +0.50
llama-3.2-3b persuade zero_shot 🔢 disc 1-6 3 0.333 0.000 100.0% 0.000 0.000 +0.33
llama-3.2-3b turkish_movie_sentiment few_shot 📊 cont 0-5 19 0.895 0.362 57.9% 0.485 0.477 -0.05
llama-3.2-3b turkish_movie_sentiment few_shot 🔢 disc 0-5 21 0.738 0.536 90.5% 0.598 0.582 +0.07
llama-3.2-3b turkish_movie_sentiment rubric_based 📊 cont 0-5 18 0.929 0.176 61.1% 0.358 0.366 +0.21
llama-3.2-3b turkish_movie_sentiment rubric_based 🔢 disc 0-5 18 0.694 0.691 100.0% 0.749 0.698 +0.03
llama-3.2-3b turkish_movie_sentiment zero_shot 📊 cont 0-5 21 0.726 0.565 76.2% 0.677 0.521 -0.05
llama-3.2-3b turkish_movie_sentiment zero_shot 🔢 disc 0-5 21 0.786 0.497 90.5% 0.562 0.533 +0.02
llama-3.2-3b turkish_offensive few_shot 📊 cont 0-1 27 0.593 -0.241 100.0% -0.254 -0.254 +0.15
llama-3.2-3b turkish_offensive few_shot 🔢 disc 0-1 27 0.667 -0.421 100.0% -0.478 -0.478 +0.22
llama-3.2-3b turkish_offensive rubric_based 📊 cont 0-1 26 0.615 -0.325 100.0% -0.344 -0.344 +0.15
llama-3.2-3b turkish_offensive rubric_based 🔢 disc 0-1 27 0.667 -0.421 100.0% -0.478 -0.478 +0.22
llama-3.2-3b turkish_offensive zero_shot 📊 cont 0-1 27 0.630 -0.330 100.0% -0.359 -0.359 +0.19
llama-3.2-3b turkish_offensive zero_shot 🔢 disc 0-1 27 0.556 -0.154 100.0% -0.158 -0.158 +0.11
llama-3.2-3b turkish_sentiment_large few_shot 📊 cont 0-2 21 0.563 0.462 100.0% 0.518 0.420 -0.40
llama-3.2-3b turkish_sentiment_large few_shot 🔢 disc 0-2 24 0.625 0.306 95.8% 0.542 0.532 -0.62
llama-3.2-3b turkish_sentiment_large rubric_based 📊 cont 0-2 21 0.529 0.246 100.0% 0.343 0.171 -0.29
llama-3.2-3b turkish_sentiment_large rubric_based 🔢 disc 0-2 21 0.571 0.447 100.0% 0.722 0.699 -0.57
llama-3.2-3b turkish_sentiment_large zero_shot 📊 cont 0-2 24 0.552 0.143 100.0% 0.268 0.078 -0.34
llama-3.2-3b turkish_sentiment_large zero_shot 🔢 disc 0-2 24 0.417 0.474 100.0% 0.617 0.558 -0.33
llama-3.3-70b brighter_emotion few_shot 📊 cont 0-3 24 0.646 0.680 100.0% 0.826 0.815 -0.40
llama-3.3-70b brighter_emotion few_shot 🔢 disc 0-3 24 0.625 0.712 100.0% 0.719 0.737 -0.12
llama-3.3-70b brighter_emotion rag_rubric 🔢 disc 0-3 1 0.000 0.000 100.0% 0.000 0.000 0.00
llama-3.3-70b brighter_emotion rubric_based 📊 cont 0-3 24 0.671 0.648 100.0% 0.799 0.791 -0.37
llama-3.3-70b brighter_emotion rubric_based 🔢 disc 0-3 24 0.667 0.682 100.0% 0.692 0.711 -0.17
llama-3.3-70b brighter_emotion zero_shot 📊 cont 0-3 24 0.650 0.680 100.0% 0.823 0.815 -0.40
llama-3.3-70b brighter_emotion zero_shot 🔢 disc 0-3 24 0.667 0.682 100.0% 0.692 0.711 -0.17
llama-3.3-70b persuade few_shot 📊 cont 1-6 3 1.633 0.000 0.0% 0.000 0.000 -1.63
llama-3.3-70b persuade few_shot 🔢 disc 1-6 3 0.000 0.000 100.0% 0.000 0.000 0.00
llama-3.3-70b persuade rag_rubric 🔢 disc 1-6 2 2.000 0.000 0.0% 0.000 0.000 -2.00
llama-3.3-70b persuade rubric_based 📊 cont 1-6 3 1.900 0.000 0.0% 0.000 0.000 -1.90
llama-3.3-70b persuade rubric_based 🔢 disc 1-6 3 2.000 0.000 0.0% 0.000 0.000 -2.00
llama-3.3-70b persuade zero_shot 📊 cont 1-6 3 1.500 0.000 0.0% 0.000 0.000 -1.50
llama-3.3-70b persuade zero_shot 🔢 disc 1-6 3 0.000 0.000 100.0% 0.000 0.000 0.00
llama-3.3-70b turkish_movie_sentiment few_shot 📊 cont 0-5 21 0.929 0.639 71.4% 0.789 0.553 -0.27
llama-3.3-70b turkish_movie_sentiment few_shot 🔢 disc 0-5 21 0.976 0.675 85.7% 0.808 0.651 -0.26
llama-3.3-70b turkish_movie_sentiment rag_rubric 🔢 disc 0-5 1 1.000 0.000 100.0% 0.000 0.000 -1.00
llama-3.3-70b turkish_movie_sentiment rubric_based 📊 cont 0-5 18 0.986 0.649 61.1% 0.810 0.672 -0.49
llama-3.3-70b turkish_movie_sentiment rubric_based 🔢 disc 0-5 18 1.083 0.639 77.8% 0.779 0.608 -0.47
llama-3.3-70b turkish_movie_sentiment zero_shot 📊 cont 0-5 21 0.957 0.639 66.7% 0.789 0.573 -0.30
llama-3.3-70b turkish_movie_sentiment zero_shot 🔢 disc 0-5 21 0.881 0.709 85.7% 0.847 0.764 -0.17
llama-3.3-70b turkish_offensive few_shot 📊 cont 0-1 27 0.222 0.571 100.0% 0.632 0.632 -0.22
llama-3.3-70b turkish_offensive few_shot 🔢 disc 0-1 27 0.222 0.571 100.0% 0.632 0.632 -0.22
llama-3.3-70b turkish_offensive rag_rubric 🔢 disc 0-1 2 0.000 0.000 100.0% 0.000 0.000 0.00
llama-3.3-70b turkish_offensive rubric_based 📊 cont 0-1 27 0.259 0.496 100.0% 0.532 0.532 -0.19
llama-3.3-70b turkish_offensive rubric_based 🔢 disc 0-1 27 0.259 0.496 100.0% 0.532 0.532 -0.19
llama-3.3-70b turkish_offensive zero_shot 📊 cont 0-1 27 0.259 0.496 100.0% 0.532 0.532 -0.19
llama-3.3-70b turkish_offensive zero_shot 🔢 disc 0-1 30 0.200 0.615 100.0% 0.667 0.667 -0.20
llama-3.3-70b turkish_sentiment_large few_shot 📊 cont 0-2 21 0.376 0.650 100.0% 0.790 0.736 -0.32
llama-3.3-70b turkish_sentiment_large few_shot 🔢 disc 0-2 24 0.250 0.667 100.0% 0.817 0.800 -0.25
llama-3.3-70b turkish_sentiment_large rag_rubric 🔢 disc 0-2 1 0.000 0.000 100.0% 0.000 0.000 0.00
llama-3.3-70b turkish_sentiment_large rubric_based 📊 cont 0-2 21 0.367 0.650 100.0% 0.790 0.729 -0.31
llama-3.3-70b turkish_sentiment_large rubric_based 🔢 disc 0-2 21 0.286 0.650 100.0% 0.806 0.794 -0.29
llama-3.3-70b turkish_sentiment_large zero_shot 📊 cont 0-2 24 0.350 0.667 100.0% 0.800 0.731 -0.30
llama-3.3-70b turkish_sentiment_large zero_shot 🔢 disc 0-2 24 0.292 0.561 95.8% 0.726 0.750 -0.29
mistral-small brighter_emotion few_shot 📊 cont 0-3 24 0.512 0.719 100.0% 0.859 0.833 -0.24
mistral-small brighter_emotion few_shot 🔢 disc 0-3 24 0.375 0.854 100.0% 0.919 0.917 -0.38
mistral-small brighter_emotion rubric_based 📊 cont 0-3 22 0.532 0.712 100.0% 0.859 0.810 -0.23
mistral-small brighter_emotion rubric_based 🔢 disc 0-3 24 0.375 0.854 100.0% 0.919 0.917 -0.38
mistral-small brighter_emotion zero_shot 📊 cont 0-3 24 0.508 0.719 100.0% 0.861 0.844 -0.23
mistral-small brighter_emotion zero_shot 🔢 disc 0-3 24 0.375 0.854 100.0% 0.919 0.917 -0.38
mistral-small persuade few_shot 📊 cont 1-6 3 1.500 0.000 0.0% 0.000 0.000 -1.50
mistral-small persuade few_shot 🔢 disc 1-6 3 1.000 0.000 100.0% 0.000 0.000 -1.00
mistral-small persuade rubric_based 📊 cont 1-6 3 1.750 0.000 0.0% 0.000 0.000 -1.75
mistral-small persuade rubric_based 🔢 disc 1-6 3 2.000 0.000 0.0% 0.000 0.000 -2.00
mistral-small persuade zero_shot 📊 cont 1-6 3 1.500 0.000 0.0% 0.000 0.000 -1.50
mistral-small persuade zero_shot 🔢 disc 1-6 3 1.000 0.000 100.0% 0.000 0.000 -1.00
mistral-small turkish_movie_sentiment few_shot 📊 cont 0-5 18 1.083 0.700 33.3% 0.893 0.806 -1.08
mistral-small turkish_movie_sentiment few_shot 🔢 disc 0-5 21 0.929 0.594 85.7% 0.720 0.611 -0.50
mistral-small turkish_movie_sentiment rubric_based 📊 cont 0-5 18 1.083 0.700 33.3% 0.893 0.806 -1.08
mistral-small turkish_movie_sentiment rubric_based 🔢 disc 0-5 18 0.917 0.600 83.3% 0.789 0.717 -0.75
mistral-small turkish_movie_sentiment zero_shot 📊 cont 0-5 21 1.000 0.708 42.9% 0.829 0.685 -0.86
mistral-small turkish_movie_sentiment zero_shot 🔢 disc 0-5 21 0.929 0.594 85.7% 0.720 0.611 -0.50
mistral-small turkish_offensive few_shot 📊 cont 0-1 27 0.333 0.342 100.0% 0.350 0.350 -0.11
mistral-small turkish_offensive few_shot 🔢 disc 0-1 27 0.333 0.342 100.0% 0.350 0.350 -0.11
mistral-small turkish_offensive rubric_based 📊 cont 0-1 27 0.333 0.342 100.0% 0.350 0.350 -0.11
mistral-small turkish_offensive rubric_based 🔢 disc 0-1 27 0.333 0.342 100.0% 0.350 0.350 -0.11
mistral-small turkish_offensive zero_shot 📊 cont 0-1 27 0.333 0.342 100.0% 0.350 0.350 -0.11
mistral-small turkish_offensive zero_shot 🔢 disc 0-1 27 0.333 0.342 100.0% 0.350 0.350 -0.11
mistral-small turkish_sentiment_large few_shot 📊 cont 0-2 21 0.407 0.650 85.7% 0.725 0.656 -0.41
mistral-small turkish_sentiment_large few_shot 🔢 disc 0-2 24 0.375 0.412 87.5% 0.590 0.655 -0.38
mistral-small turkish_sentiment_large rubric_based 📊 cont 0-2 21 0.407 0.650 85.7% 0.725 0.656 -0.41
mistral-small turkish_sentiment_large rubric_based 🔢 disc 0-2 21 0.429 0.386 85.7% 0.564 0.617 -0.43
mistral-small turkish_sentiment_large zero_shot 📊 cont 0-2 24 0.375 0.667 87.5% 0.740 0.650 -0.38
mistral-small turkish_sentiment_large zero_shot 🔢 disc 0-2 24 0.375 0.412 87.5% 0.590 0.655 -0.38
gpt-4o-mini brighter_emotion few_shot 📊 cont 0-3 24 0.590 0.529 87.5% 0.772 0.692 -0.19
gpt-4o-mini brighter_emotion few_shot 🔢 disc 0-3 24 0.708 0.618 100.0% 0.635 0.595 -0.21
gpt-4o-mini brighter_emotion rubric_based 📊 cont 0-3 21 0.564 0.578 85.7% 0.804 0.677 -0.11
gpt-4o-mini brighter_emotion rubric_based 🔢 disc 0-3 24 0.708 0.618 100.0% 0.635 0.595 -0.21
gpt-4o-mini brighter_emotion zero_shot 📊 cont 0-3 24 0.602 0.529 87.5% 0.760 0.635 -0.21
gpt-4o-mini brighter_emotion zero_shot 🔢 disc 0-3 24 0.542 0.714 100.0% 0.716 0.669 -0.04
gpt-4o-mini persuade few_shot 📊 cont 1-6 3 0.000 0.000 100.0% 0.000 0.000 0.00
gpt-4o-mini persuade few_shot 🔢 disc 1-6 3 0.000 0.000 100.0% 0.000 0.000 0.00
gpt-4o-mini persuade rubric_based 📊 cont 1-6 3 1.833 0.000 0.0% 0.000 0.000 -1.83
gpt-4o-mini persuade rubric_based 🔢 disc 1-6 3 2.000 0.000 0.0% 0.000 0.000 -2.00
gpt-4o-mini persuade zero_shot 📊 cont 1-6 3 0.667 0.000 100.0% 0.000 0.000 -0.67
gpt-4o-mini persuade zero_shot 🔢 disc 1-6 3 1.000 0.000 100.0% 0.000 0.000 -1.00
gpt-4o-mini turkish_movie_sentiment few_shot 📊 cont 0-5 18 0.939 0.697 77.8% 0.798 0.697 -0.58
gpt-4o-mini turkish_movie_sentiment few_shot 🔢 disc 0-5 21 1.119 0.633 81.0% 0.760 0.561 -0.50
gpt-4o-mini turkish_movie_sentiment rubric_based 📊 cont 0-5 18 1.011 0.667 55.6% 0.799 0.642 -0.63
gpt-4o-mini turkish_movie_sentiment rubric_based 🔢 disc 0-5 18 1.083 0.667 83.3% 0.809 0.783 -0.75
gpt-4o-mini turkish_movie_sentiment zero_shot 📊 cont 0-5 21 0.848 0.674 85.7% 0.813 0.704 -0.50
gpt-4o-mini turkish_movie_sentiment zero_shot 🔢 disc 0-5 21 1.071 0.650 85.7% 0.763 0.562 -0.45
gpt-4o-mini turkish_offensive few_shot 📊 cont 0-1 27 0.333 0.342 100.0% 0.350 0.350 -0.11
gpt-4o-mini turkish_offensive few_shot 🔢 disc 0-1 27 0.333 0.342 100.0% 0.350 0.350 -0.11
gpt-4o-mini turkish_offensive rubric_based 📊 cont 0-1 27 0.333 0.342 100.0% 0.350 0.350 -0.11
gpt-4o-mini turkish_offensive rubric_based 🔢 disc 0-1 27 0.333 0.342 100.0% 0.350 0.350 -0.11
gpt-4o-mini turkish_offensive zero_shot 📊 cont 0-1 27 0.333 0.342 100.0% 0.350 0.350 -0.11
gpt-4o-mini turkish_offensive zero_shot 🔢 disc 0-1 27 0.333 0.342 100.0% 0.350 0.350 -0.11
gpt-4o-mini turkish_sentiment_large few_shot 📊 cont 0-2 21 0.374 0.650 95.2% 0.784 0.704 -0.37
gpt-4o-mini turkish_sentiment_large few_shot 🔢 disc 0-2 24 0.250 0.667 100.0% 0.817 0.800 -0.25
gpt-4o-mini turkish_sentiment_large rubric_based 📊 cont 0-2 21 0.341 0.650 100.0% 0.814 0.758 -0.34
gpt-4o-mini turkish_sentiment_large rubric_based 🔢 disc 0-2 21 0.286 0.650 100.0% 0.806 0.794 -0.29
gpt-4o-mini turkish_sentiment_large zero_shot 📊 cont 0-2 24 0.350 0.667 100.0% 0.814 0.743 -0.35
gpt-4o-mini turkish_sentiment_large zero_shot 🔢 disc 0-2 24 0.250 0.667 100.0% 0.817 0.800 -0.25

🌡️ Temperature Analysis

Comparison across temperature settings. Normalized MAE allows cross-dataset comparison (0-1 scale).

Model Dataset Range Strategy T=0.0 T=0.2 T=1.0
llama-3.3-70b persuade 1-6 zero_shot 0.150 0.150 0.150
llama-3.3-70b turkish_movie_sentiment 0-5 zero_shot 0.186 0.171 0.194
llama-3.3-70b turkish_sentiment_large 0-2 zero_shot 0.150 0.150 0.181
llama-3.1-8b persuade 1-6 zero_shot 0.150 0.120 0.250
llama-3.3-70b turkish_offensive 0-1 zero_shot 0.211 0.263 0.211
llama-3.1-8b turkish_sentiment_large 0-2 zero_shot 0.172 0.175 0.175
llama-3.1-8b turkish_movie_sentiment 0-5 zero_shot 0.218 0.198 0.172
llama-3.1-8b turkish_offensive 0-1 zero_shot 0.526 0.556 0.444
llama-3.2-3b persuade 1-6 zero_shot 0.050 0.050 0.150
llama-3.2-3b turkish_sentiment_large 0-2 zero_shot 0.234 0.219 0.273
llama-3.2-3b turkish_movie_sentiment 0-5 zero_shot 0.121 0.136 0.196
llama-3.2-3b turkish_offensive 0-1 zero_shot 0.611 0.556 0.611
gemma-3-27b persuade 1-6 zero_shot 0.250 0.250 0.215
gemma-3-27b turkish_movie_sentiment 0-5 zero_shot 0.180 0.166 0.180
gemma-3-27b turkish_sentiment_large 0-2 zero_shot 0.155 0.152 0.156
mistral-small persuade 1-6 zero_shot 0.250 0.250 0.250
gemma-3-27b turkish_offensive 0-1 zero_shot 0.333 0.333 0.333
mistral-small turkish_movie_sentiment 0-5 zero_shot 0.193 0.193 0.193
mistral-small turkish_sentiment_large 0-2 zero_shot 0.188 0.188 0.188
mistral-small turkish_offensive 0-1 zero_shot 0.333 0.333 0.333
gpt-4o-mini persuade 1-6 zero_shot 0.150 0.150 0.200
gpt-4o-mini turkish_movie_sentiment 0-5 zero_shot 0.190 0.190 0.196
gpt-4o-mini turkish_offensive 0-1 zero_shot 0.333 0.333 0.333
hermes-3-405b persuade 1-6 zero_shot 0.050 0.050 0.050
hermes-3-405b turkish_movie_sentiment 0-5 zero_shot 0.144 0.158 0.169
hermes-3-405b turkish_offensive 0-1 zero_shot 0.222 0.222 0.278
gpt-4o-mini turkish_sentiment_large 0-2 zero_shot 0.152 0.147 0.152
hermes-3-405b turkish_sentiment_large 0-2 zero_shot 0.195 0.188 0.202
llama-3.3-70b persuade 1-6 few_shot 0.150 0.150 0.190
llama-3.1-8b persuade 1-6 few_shot 0.020 0.120 0.170
llama-3.2-3b persuade 1-6 few_shot 0.025 0.025 0.025
gemma-3-27b persuade 1-6 few_shot 0.150 0.150 0.150
mistral-small persuade 1-6 few_shot 0.250 0.250 0.250
gpt-4o-mini persuade 1-6 few_shot 0.000 0.000 0.000
llama-3.3-70b turkish_offensive 0-1 few_shot 0.222 0.222 0.222
hermes-3-405b persuade 1-6 few_shot 0.050 0.050 0.050
llama-3.1-8b turkish_offensive 0-1 few_shot 0.556 0.556 0.444
llama-3.2-3b turkish_offensive 0-1 few_shot 0.667 0.611 0.611
gemma-3-27b turkish_offensive 0-1 few_shot 0.333 0.333 0.278
mistral-small turkish_offensive 0-1 few_shot 0.333 0.333 0.333
llama-3.3-70b turkish_movie_sentiment 0-5 few_shot 0.200 0.186 0.186
llama-3.1-8b turkish_movie_sentiment 0-5 few_shot 0.221 0.221 0.160
llama-3.3-70b turkish_sentiment_large 0-2 few_shot 0.157 0.157 0.150
gpt-4o-mini turkish_offensive 0-1 few_shot 0.333 0.333 0.333
llama-3.2-3b turkish_movie_sentiment 0-5 few_shot 0.136 0.138 0.215
llama-3.1-8b turkish_sentiment_large 0-2 few_shot 0.177 0.183 0.168
llama-3.2-3b turkish_sentiment_large 0-2 few_shot 0.250 0.300 0.344
gemma-3-27b turkish_movie_sentiment 0-5 few_shot 0.190 0.194 0.194
llama-3.3-70b persuade 1-6 rubric_based 0.390 0.390 0.390
gemma-3-27b turkish_sentiment_large 0-2 few_shot 0.160 0.160 0.162
mistral-small turkish_movie_sentiment 0-5 few_shot 0.200 0.200 0.200
llama-3.1-8b persuade 1-6 rubric_based 0.350 0.350 0.350
llama-3.2-3b persuade 1-6 rubric_based 0.350 0.250 0.050
mistral-small turkish_sentiment_large 0-2 few_shot 0.195 0.195 0.195
gpt-4o-mini turkish_movie_sentiment 0-5 few_shot 0.198 0.198 0.225
gemma-3-27b persuade 1-6 rubric_based 0.390 0.390 0.390
hermes-3-405b turkish_offensive 0-1 few_shot 0.222 0.222 0.222
hermes-3-405b turkish_movie_sentiment 0-5 few_shot 0.151 0.151 0.158
mistral-small persuade 1-6 rubric_based 0.375 0.375 0.375
hermes-3-405b turkish_sentiment_large 0-2 few_shot 0.188 0.188 0.205
gpt-4o-mini turkish_sentiment_large 0-2 few_shot 0.150 0.155 0.157
hermes-3-405b persuade 1-6 rubric_based 0.350 0.350 0.400
gpt-4o-mini persuade 1-6 rubric_based 0.400 0.400 0.350
llama-3.3-70b turkish_offensive 0-1 rubric_based 0.222 0.222 0.333
llama-3.1-8b turkish_offensive 0-1 rubric_based 0.556 0.500 0.444
llama-3.2-3b turkish_offensive 0-1 rubric_based 0.667 0.667 0.588
gemma-3-27b turkish_offensive 0-1 rubric_based 0.333 0.333 0.333
mistral-small turkish_offensive 0-1 rubric_based 0.333 0.333 0.333
llama-3.3-70b persuade 1-6 rag_rubric 0.400 0.400 -
hermes-3-405b turkish_offensive 0-1 rubric_based 0.222 0.222 0.278
gpt-4o-mini turkish_offensive 0-1 rubric_based 0.333 0.333 0.333
llama-3.3-70b turkish_sentiment_large 0-2 rubric_based 0.164 0.164 0.161
llama-3.3-70b turkish_movie_sentiment 0-5 rubric_based 0.203 0.220 0.197
llama-3.1-8b brighter_emotion 0-3 zero_shot 0.208 0.194 0.248
llama-3.3-70b brighter_emotion 0-3 zero_shot 0.213 0.233 0.213
llama-3.1-8b turkish_sentiment_large 0-2 rubric_based 0.154 0.191 0.193
llama-3.1-8b turkish_movie_sentiment 0-5 rubric_based 0.221 0.200 0.232
llama-3.2-3b brighter_emotion 0-3 zero_shot 0.193 0.245 0.177
llama-3.2-3b turkish_sentiment_large 0-2 rubric_based 0.268 0.232 0.326
llama-3.2-3b turkish_movie_sentiment 0-5 rubric_based 0.154 0.163 0.170
gemma-3-27b brighter_emotion 0-3 zero_shot 0.201 0.205 0.194
gemma-3-27b turkish_sentiment_large 0-2 rubric_based 0.171 0.171 0.171
mistral-small brighter_emotion 0-3 zero_shot 0.148 0.148 0.146
mistral-small turkish_sentiment_large 0-2 rubric_based 0.209 0.209 0.209
gemma-3-27b turkish_movie_sentiment 0-5 rubric_based 0.206 0.210 0.206
gpt-4o-mini brighter_emotion 0-3 zero_shot 0.198 0.203 0.171
mistral-small turkish_movie_sentiment 0-5 rubric_based 0.200 0.200 0.200
hermes-3-405b brighter_emotion 0-3 zero_shot 0.151 0.155 0.144
llama-3.3-70b turkish_offensive 0-1 rag_rubric 0.000 0.000 -
gpt-4o-mini turkish_sentiment_large 0-2 rubric_based 0.166 0.166 0.137
hermes-3-405b turkish_sentiment_large 0-2 rubric_based 0.205 0.205 0.211
hermes-3-405b turkish_movie_sentiment 0-5 rubric_based 0.163 0.163 0.172
gpt-4o-mini turkish_movie_sentiment 0-5 rubric_based 0.210 0.213 0.205
llama-3.3-70b brighter_emotion 0-3 few_shot 0.191 0.213 0.232
llama-3.1-8b brighter_emotion 0-3 few_shot 0.219 0.208 0.205
llama-3.2-3b brighter_emotion 0-3 few_shot 0.193 0.203 0.130
mistral-small brighter_emotion 0-3 few_shot 0.148 0.148 0.148
gemma-3-27b brighter_emotion 0-3 few_shot 0.199 0.202 0.203
llama-3.3-70b turkish_sentiment_large 0-2 rag_rubric 0.000 - -
llama-3.3-70b turkish_movie_sentiment 0-5 rag_rubric 0.200 - -
gpt-4o-mini brighter_emotion 0-3 few_shot 0.202 0.203 0.244
hermes-3-405b brighter_emotion 0-3 few_shot 0.145 0.151 0.189
llama-3.3-70b brighter_emotion 0-3 rubric_based 0.232 0.211 0.225
llama-3.1-8b brighter_emotion 0-3 rubric_based 0.224 0.224 0.243
llama-3.2-3b brighter_emotion 0-3 rubric_based 0.208 0.203 0.312
mistral-small brighter_emotion 0-3 rubric_based 0.148 0.150 0.152
gemma-3-27b brighter_emotion 0-3 rubric_based 0.199 0.194 0.188
gpt-4o-mini brighter_emotion 0-3 rubric_based 0.198 0.221 0.222
hermes-3-405b brighter_emotion 0-3 rubric_based 0.149 0.160 0.171
llama-3.3-70b brighter_emotion 0-3 rag_rubric 0.000 - -
Model Dataset Range Strategy T=0.0 T=0.2 T=1.0
llama-3.3-70b persuade 1-6 zero_shot 0.750 0.750 0.750
llama-3.3-70b turkish_movie_sentiment 0-5 zero_shot 0.929 0.857 0.971
llama-3.3-70b turkish_sentiment_large 0-2 zero_shot 0.300 0.300 0.362
llama-3.1-8b persuade 1-6 zero_shot 0.750 0.600 1.250
llama-3.3-70b turkish_offensive 0-1 zero_shot 0.211 0.263 0.211
llama-3.1-8b turkish_sentiment_large 0-2 zero_shot 0.344 0.350 0.350
llama-3.1-8b turkish_movie_sentiment 0-5 zero_shot 1.089 0.989 0.859
llama-3.1-8b turkish_offensive 0-1 zero_shot 0.526 0.556 0.444
llama-3.2-3b persuade 1-6 zero_shot 0.250 0.250 0.750
llama-3.2-3b turkish_sentiment_large 0-2 zero_shot 0.469 0.438 0.547
llama-3.2-3b turkish_movie_sentiment 0-5 zero_shot 0.607 0.679 0.982
llama-3.2-3b turkish_offensive 0-1 zero_shot 0.611 0.556 0.611
gemma-3-27b persuade 1-6 zero_shot 1.250 1.250 1.075
gemma-3-27b turkish_movie_sentiment 0-5 zero_shot 0.900 0.829 0.900
gemma-3-27b turkish_sentiment_large 0-2 zero_shot 0.309 0.303 0.312
mistral-small persuade 1-6 zero_shot 1.250 1.250 1.250
gemma-3-27b turkish_offensive 0-1 zero_shot 0.333 0.333 0.333
mistral-small turkish_movie_sentiment 0-5 zero_shot 0.964 0.964 0.964
mistral-small turkish_sentiment_large 0-2 zero_shot 0.375 0.375 0.375
mistral-small turkish_offensive 0-1 zero_shot 0.333 0.333 0.333
gpt-4o-mini persuade 1-6 zero_shot 0.750 0.750 1.000
gpt-4o-mini turkish_movie_sentiment 0-5 zero_shot 0.950 0.950 0.979
gpt-4o-mini turkish_offensive 0-1 zero_shot 0.333 0.333 0.333
hermes-3-405b persuade 1-6 zero_shot 0.250 0.250 0.250
hermes-3-405b turkish_movie_sentiment 0-5 zero_shot 0.718 0.789 0.843
hermes-3-405b turkish_offensive 0-1 zero_shot 0.222 0.222 0.278
gpt-4o-mini turkish_sentiment_large 0-2 zero_shot 0.303 0.294 0.303
hermes-3-405b turkish_sentiment_large 0-2 zero_shot 0.391 0.375 0.403
llama-3.3-70b persuade 1-6 few_shot 0.750 0.750 0.950
llama-3.1-8b persuade 1-6 few_shot 0.100 0.600 0.850
llama-3.2-3b persuade 1-6 few_shot 0.125 0.125 0.125
gemma-3-27b persuade 1-6 few_shot 0.750 0.750 0.750
mistral-small persuade 1-6 few_shot 1.250 1.250 1.250
gpt-4o-mini persuade 1-6 few_shot 0.000 0.000 0.000
llama-3.3-70b turkish_offensive 0-1 few_shot 0.222 0.222 0.222
hermes-3-405b persuade 1-6 few_shot 0.250 0.250 0.250
llama-3.1-8b turkish_offensive 0-1 few_shot 0.556 0.556 0.444
llama-3.2-3b turkish_offensive 0-1 few_shot 0.667 0.611 0.611
gemma-3-27b turkish_offensive 0-1 few_shot 0.333 0.333 0.278
mistral-small turkish_offensive 0-1 few_shot 0.333 0.333 0.333
llama-3.3-70b turkish_movie_sentiment 0-5 few_shot 1.000 0.929 0.929
llama-3.1-8b turkish_movie_sentiment 0-5 few_shot 1.107 1.104 0.800
llama-3.3-70b turkish_sentiment_large 0-2 few_shot 0.313 0.313 0.300
gpt-4o-mini turkish_offensive 0-1 few_shot 0.333 0.333 0.333
llama-3.2-3b turkish_movie_sentiment 0-5 few_shot 0.679 0.692 1.077
llama-3.1-8b turkish_sentiment_large 0-2 few_shot 0.353 0.367 0.337
llama-3.2-3b turkish_sentiment_large 0-2 few_shot 0.500 0.600 0.689
gemma-3-27b turkish_movie_sentiment 0-5 few_shot 0.950 0.969 0.969
llama-3.3-70b persuade 1-6 rubric_based 1.950 1.950 1.950
gemma-3-27b turkish_sentiment_large 0-2 few_shot 0.320 0.320 0.323
mistral-small turkish_movie_sentiment 0-5 few_shot 1.000 1.000 1.000
llama-3.1-8b persuade 1-6 rubric_based 1.750 1.750 1.750
llama-3.2-3b persuade 1-6 rubric_based 1.750 1.250 0.250
mistral-small turkish_sentiment_large 0-2 few_shot 0.390 0.390 0.390
gpt-4o-mini turkish_movie_sentiment 0-5 few_shot 0.992 0.992 1.123
gemma-3-27b persuade 1-6 rubric_based 1.950 1.950 1.950
hermes-3-405b turkish_offensive 0-1 few_shot 0.222 0.222 0.222
hermes-3-405b turkish_movie_sentiment 0-5 few_shot 0.754 0.754 0.788
mistral-small persuade 1-6 rubric_based 1.875 1.875 1.875
hermes-3-405b turkish_sentiment_large 0-2 few_shot 0.377 0.377 0.410
gpt-4o-mini turkish_sentiment_large 0-2 few_shot 0.300 0.310 0.313
hermes-3-405b persuade 1-6 rubric_based 1.750 1.750 2.000
gpt-4o-mini persuade 1-6 rubric_based 2.000 2.000 1.750
llama-3.3-70b turkish_offensive 0-1 rubric_based 0.222 0.222 0.333
llama-3.1-8b turkish_offensive 0-1 rubric_based 0.556 0.500 0.444
llama-3.2-3b turkish_offensive 0-1 rubric_based 0.667 0.667 0.588
gemma-3-27b turkish_offensive 0-1 rubric_based 0.333 0.333 0.333
mistral-small turkish_offensive 0-1 rubric_based 0.333 0.333 0.333
llama-3.3-70b persuade 1-6 rag_rubric 2.000 2.000 -
hermes-3-405b turkish_offensive 0-1 rubric_based 0.222 0.222 0.278
gpt-4o-mini turkish_offensive 0-1 rubric_based 0.333 0.333 0.333
llama-3.3-70b turkish_sentiment_large 0-2 rubric_based 0.329 0.329 0.321
llama-3.3-70b turkish_movie_sentiment 0-5 rubric_based 1.017 1.100 0.987
llama-3.1-8b brighter_emotion 0-3 zero_shot 0.625 0.581 0.744
llama-3.3-70b brighter_emotion 0-3 zero_shot 0.638 0.700 0.638
llama-3.1-8b turkish_sentiment_large 0-2 rubric_based 0.307 0.382 0.386
llama-3.1-8b turkish_movie_sentiment 0-5 rubric_based 1.104 1.000 1.162
llama-3.2-3b brighter_emotion 0-3 zero_shot 0.578 0.734 0.531
llama-3.2-3b turkish_sentiment_large 0-2 rubric_based 0.536 0.464 0.651
llama-3.2-3b turkish_movie_sentiment 0-5 rubric_based 0.771 0.812 0.852
gemma-3-27b brighter_emotion 0-3 zero_shot 0.603 0.616 0.581
gemma-3-27b turkish_sentiment_large 0-2 rubric_based 0.343 0.343 0.343
mistral-small brighter_emotion 0-3 zero_shot 0.444 0.444 0.438
mistral-small turkish_sentiment_large 0-2 rubric_based 0.418 0.418 0.418
gemma-3-27b turkish_movie_sentiment 0-5 rubric_based 1.029 1.050 1.029
gpt-4o-mini brighter_emotion 0-3 zero_shot 0.594 0.609 0.512
mistral-small turkish_movie_sentiment 0-5 rubric_based 1.000 1.000 1.000
hermes-3-405b brighter_emotion 0-3 zero_shot 0.453 0.466 0.431
llama-3.3-70b turkish_offensive 0-1 rag_rubric 0.000 0.000 -
gpt-4o-mini turkish_sentiment_large 0-2 rubric_based 0.332 0.332 0.275
hermes-3-405b turkish_sentiment_large 0-2 rubric_based 0.411 0.411 0.421
hermes-3-405b turkish_movie_sentiment 0-5 rubric_based 0.817 0.817 0.858
gpt-4o-mini turkish_movie_sentiment 0-5 rubric_based 1.050 1.067 1.025
llama-3.3-70b brighter_emotion 0-3 few_shot 0.572 0.638 0.697
llama-3.1-8b brighter_emotion 0-3 few_shot 0.656 0.625 0.616
llama-3.2-3b brighter_emotion 0-3 few_shot 0.578 0.609 0.391
mistral-small brighter_emotion 0-3 few_shot 0.444 0.444 0.444
gemma-3-27b brighter_emotion 0-3 few_shot 0.597 0.606 0.609
llama-3.3-70b turkish_sentiment_large 0-2 rag_rubric 0.000 - -
llama-3.3-70b turkish_movie_sentiment 0-5 rag_rubric 1.000 - -
gpt-4o-mini brighter_emotion 0-3 few_shot 0.606 0.609 0.731
hermes-3-405b brighter_emotion 0-3 few_shot 0.434 0.453 0.566
llama-3.3-70b brighter_emotion 0-3 rubric_based 0.697 0.634 0.675
llama-3.1-8b brighter_emotion 0-3 rubric_based 0.672 0.672 0.728
llama-3.2-3b brighter_emotion 0-3 rubric_based 0.625 0.609 0.938
mistral-small brighter_emotion 0-3 rubric_based 0.444 0.450 0.457
gemma-3-27b brighter_emotion 0-3 rubric_based 0.597 0.581 0.562
gpt-4o-mini brighter_emotion 0-3 rubric_based 0.593 0.663 0.667
hermes-3-405b brighter_emotion 0-3 rubric_based 0.447 0.480 0.513
llama-3.3-70b brighter_emotion 0-3 rag_rubric 0.000 - -
Model Dataset Range Strategy T=0.0 T=0.2 T=1.0
llama-3.3-70b persuade 1-6 zero_shot 50.0% 50.0% 50.0%
llama-3.3-70b turkish_movie_sentiment 0-5 zero_shot 7.1% 14.3% 7.1%
llama-3.3-70b turkish_sentiment_large 0-2 zero_shot 75.0% 75.0% 75.0%
llama-3.1-8b persuade 1-6 zero_shot 50.0% 50.0% 0.0%
llama-3.3-70b turkish_offensive 0-1 zero_shot 78.9% 73.7% 78.9%
llama-3.1-8b turkish_sentiment_large 0-2 zero_shot 68.8% 68.8% 75.0%
llama-3.1-8b turkish_movie_sentiment 0-5 zero_shot 35.7% 28.6% 21.4%
llama-3.1-8b turkish_offensive 0-1 zero_shot 47.4% 44.4% 55.6%
llama-3.2-3b persuade 1-6 zero_shot 100.0% 100.0% 50.0%
llama-3.2-3b turkish_sentiment_large 0-2 zero_shot 68.8% 68.8% 43.8%
llama-3.2-3b turkish_movie_sentiment 0-5 zero_shot 35.7% 35.7% 21.4%
llama-3.2-3b turkish_offensive 0-1 zero_shot 38.9% 44.4% 38.9%
gemma-3-27b persuade 1-6 zero_shot 0.0% 0.0% 0.0%
gemma-3-27b turkish_movie_sentiment 0-5 zero_shot 28.6% 28.6% 35.7%
gemma-3-27b turkish_sentiment_large 0-2 zero_shot 75.0% 75.0% 75.0%
mistral-small persuade 1-6 zero_shot 0.0% 0.0% 0.0%
gemma-3-27b turkish_offensive 0-1 zero_shot 66.7% 66.7% 66.7%
mistral-small turkish_movie_sentiment 0-5 zero_shot 21.4% 21.4% 21.4%
mistral-small turkish_sentiment_large 0-2 zero_shot 75.0% 75.0% 75.0%
mistral-small turkish_offensive 0-1 zero_shot 66.7% 66.7% 66.7%
gpt-4o-mini persuade 1-6 zero_shot 50.0% 50.0% 0.0%
gpt-4o-mini turkish_movie_sentiment 0-5 zero_shot 21.4% 21.4% 14.3%
gpt-4o-mini turkish_offensive 0-1 zero_shot 66.7% 66.7% 66.7%
hermes-3-405b persuade 1-6 zero_shot 100.0% 100.0% 100.0%
hermes-3-405b turkish_movie_sentiment 0-5 zero_shot 28.6% 21.4% 28.6%
hermes-3-405b turkish_offensive 0-1 zero_shot 77.8% 77.8% 72.2%
gpt-4o-mini turkish_sentiment_large 0-2 zero_shot 75.0% 75.0% 75.0%
hermes-3-405b turkish_sentiment_large 0-2 zero_shot 75.0% 75.0% 75.0%
llama-3.3-70b persuade 1-6 few_shot 50.0% 50.0% 50.0%
llama-3.1-8b persuade 1-6 few_shot 100.0% 50.0% 50.0%
llama-3.2-3b persuade 1-6 few_shot 100.0% 100.0% 100.0%
gemma-3-27b persuade 1-6 few_shot 50.0% 50.0% 50.0%
mistral-small persuade 1-6 few_shot 0.0% 0.0% 0.0%
gpt-4o-mini persuade 1-6 few_shot 100.0% 100.0% 100.0%
llama-3.3-70b turkish_offensive 0-1 few_shot 77.8% 77.8% 77.8%
hermes-3-405b persuade 1-6 few_shot 100.0% 100.0% 100.0%
llama-3.1-8b turkish_offensive 0-1 few_shot 44.4% 44.4% 55.6%
llama-3.2-3b turkish_offensive 0-1 few_shot 33.3% 38.9% 38.9%
gemma-3-27b turkish_offensive 0-1 few_shot 66.7% 66.7% 72.2%
mistral-small turkish_offensive 0-1 few_shot 66.7% 66.7% 66.7%
llama-3.3-70b turkish_movie_sentiment 0-5 few_shot 0.0% 7.1% 7.1%
llama-3.1-8b turkish_movie_sentiment 0-5 few_shot 28.6% 28.6% 30.8%
llama-3.3-70b turkish_sentiment_large 0-2 few_shot 73.3% 73.3% 73.3%
gpt-4o-mini turkish_offensive 0-1 few_shot 66.7% 66.7% 66.7%
llama-3.2-3b turkish_movie_sentiment 0-5 few_shot 28.6% 30.8% 23.1%
llama-3.1-8b turkish_sentiment_large 0-2 few_shot 66.7% 66.7% 66.7%
llama-3.2-3b turkish_sentiment_large 0-2 few_shot 60.0% 53.3% 46.7%
gemma-3-27b turkish_movie_sentiment 0-5 few_shot 23.1% 23.1% 23.1%
llama-3.3-70b persuade 1-6 rubric_based 0.0% 0.0% 0.0%
gemma-3-27b turkish_sentiment_large 0-2 few_shot 73.3% 73.3% 73.3%
mistral-small turkish_movie_sentiment 0-5 few_shot 15.4% 15.4% 15.4%
llama-3.1-8b persuade 1-6 rubric_based 0.0% 0.0% 0.0%
llama-3.2-3b persuade 1-6 rubric_based 0.0% 0.0% 100.0%
mistral-small turkish_sentiment_large 0-2 few_shot 73.3% 73.3% 73.3%
gpt-4o-mini turkish_movie_sentiment 0-5 few_shot 15.4% 15.4% 7.7%
gemma-3-27b persuade 1-6 rubric_based 0.0% 0.0% 0.0%
hermes-3-405b turkish_offensive 0-1 few_shot 77.8% 77.8% 77.8%
hermes-3-405b turkish_movie_sentiment 0-5 few_shot 23.1% 23.1% 23.1%
mistral-small persuade 1-6 rubric_based 0.0% 0.0% 0.0%
hermes-3-405b turkish_sentiment_large 0-2 few_shot 73.3% 73.3% 73.3%
gpt-4o-mini turkish_sentiment_large 0-2 few_shot 73.3% 73.3% 73.3%
hermes-3-405b persuade 1-6 rubric_based 0.0% 0.0% 0.0%
gpt-4o-mini persuade 1-6 rubric_based 0.0% 0.0% 0.0%
llama-3.3-70b turkish_offensive 0-1 rubric_based 77.8% 77.8% 66.7%
llama-3.1-8b turkish_offensive 0-1 rubric_based 44.4% 50.0% 55.6%
llama-3.2-3b turkish_offensive 0-1 rubric_based 33.3% 33.3% 41.2%
gemma-3-27b turkish_offensive 0-1 rubric_based 66.7% 66.7% 66.7%
mistral-small turkish_offensive 0-1 rubric_based 66.7% 66.7% 66.7%
llama-3.3-70b persuade 1-6 rag_rubric 0.0% 0.0% -
hermes-3-405b turkish_offensive 0-1 rubric_based 77.8% 77.8% 72.2%
gpt-4o-mini turkish_offensive 0-1 rubric_based 66.7% 66.7% 66.7%
llama-3.3-70b turkish_sentiment_large 0-2 rubric_based 71.4% 71.4% 71.4%
llama-3.3-70b turkish_movie_sentiment 0-5 rubric_based 0.0% 0.0% 8.3%
llama-3.1-8b brighter_emotion 0-3 zero_shot 31.2% 31.2% 37.5%
llama-3.3-70b brighter_emotion 0-3 zero_shot 31.2% 25.0% 31.2%
llama-3.1-8b turkish_sentiment_large 0-2 rubric_based 71.4% 64.3% 64.3%
llama-3.1-8b turkish_movie_sentiment 0-5 rubric_based 16.7% 25.0% 0.0%
llama-3.2-3b brighter_emotion 0-3 zero_shot 50.0% 25.0% 62.5%
llama-3.2-3b turkish_sentiment_large 0-2 rubric_based 57.1% 64.3% 42.9%
llama-3.2-3b turkish_movie_sentiment 0-5 rubric_based 16.7% 16.7% 16.7%
gemma-3-27b brighter_emotion 0-3 zero_shot 56.2% 56.2% 56.2%
gemma-3-27b turkish_sentiment_large 0-2 rubric_based 71.4% 71.4% 71.4%
mistral-small brighter_emotion 0-3 zero_shot 56.2% 56.2% 56.2%
mistral-small turkish_sentiment_large 0-2 rubric_based 71.4% 71.4% 71.4%
gemma-3-27b turkish_movie_sentiment 0-5 rubric_based 16.7% 16.7% 16.7%
gpt-4o-mini brighter_emotion 0-3 zero_shot 31.2% 31.2% 43.8%
mistral-small turkish_movie_sentiment 0-5 rubric_based 16.7% 16.7% 16.7%
hermes-3-405b brighter_emotion 0-3 zero_shot 56.2% 50.0% 62.5%
llama-3.3-70b turkish_offensive 0-1 rag_rubric 100.0% 100.0% -
gpt-4o-mini turkish_sentiment_large 0-2 rubric_based 71.4% 71.4% 71.4%
hermes-3-405b turkish_sentiment_large 0-2 rubric_based 71.4% 71.4% 71.4%
hermes-3-405b turkish_movie_sentiment 0-5 rubric_based 16.7% 16.7% 16.7%
gpt-4o-mini turkish_movie_sentiment 0-5 rubric_based 16.7% 16.7% 16.7%
llama-3.3-70b brighter_emotion 0-3 few_shot 37.5% 31.2% 25.0%
llama-3.1-8b brighter_emotion 0-3 few_shot 25.0% 37.5% 31.2%
llama-3.2-3b brighter_emotion 0-3 few_shot 50.0% 43.8% 68.8%
mistral-small brighter_emotion 0-3 few_shot 56.2% 56.2% 56.2%
gemma-3-27b brighter_emotion 0-3 few_shot 56.2% 56.2% 56.2%
llama-3.3-70b turkish_sentiment_large 0-2 rag_rubric 100.0% - -
llama-3.3-70b turkish_movie_sentiment 0-5 rag_rubric 0.0% - -
gpt-4o-mini brighter_emotion 0-3 few_shot 31.2% 31.2% 18.8%
hermes-3-405b brighter_emotion 0-3 few_shot 56.2% 56.2% 56.2%
llama-3.3-70b brighter_emotion 0-3 rubric_based 25.0% 31.2% 25.0%
llama-3.1-8b brighter_emotion 0-3 rubric_based 25.0% 31.2% 37.5%
llama-3.2-3b brighter_emotion 0-3 rubric_based 43.8% 43.8% 37.5%
mistral-small brighter_emotion 0-3 rubric_based 56.2% 53.3% 53.3%
gemma-3-27b brighter_emotion 0-3 rubric_based 56.2% 56.2% 56.2%
gpt-4o-mini brighter_emotion 0-3 rubric_based 33.3% 26.7% 26.7%
hermes-3-405b brighter_emotion 0-3 rubric_based 53.3% 46.7% 46.7%
llama-3.3-70b brighter_emotion 0-3 rag_rubric 100.0% - -
Model Dataset Range Strategy T=0.0 T=0.2 T=1.0
llama-3.3-70b persuade 1-6 zero_shot 50.0% 50.0% 50.0%
llama-3.3-70b turkish_movie_sentiment 0-5 zero_shot 78.6% 78.6% 71.4%
llama-3.3-70b turkish_sentiment_large 0-2 zero_shot 100.0% 100.0% 93.8%
llama-3.1-8b persuade 1-6 zero_shot 100.0% 100.0% 50.0%
llama-3.3-70b turkish_offensive 0-1 zero_shot 100.0% 100.0% 100.0%
llama-3.1-8b turkish_sentiment_large 0-2 zero_shot 100.0% 100.0% 93.8%
llama-3.1-8b turkish_movie_sentiment 0-5 zero_shot 64.3% 71.4% 71.4%
llama-3.1-8b turkish_offensive 0-1 zero_shot 100.0% 100.0% 100.0%
llama-3.2-3b persuade 1-6 zero_shot 100.0% 100.0% 100.0%
llama-3.2-3b turkish_sentiment_large 0-2 zero_shot 100.0% 100.0% 100.0%
llama-3.2-3b turkish_movie_sentiment 0-5 zero_shot 92.9% 85.7% 71.4%
llama-3.2-3b turkish_offensive 0-1 zero_shot 100.0% 100.0% 100.0%
gemma-3-27b persuade 1-6 zero_shot 50.0% 50.0% 50.0%
gemma-3-27b turkish_movie_sentiment 0-5 zero_shot 78.6% 85.7% 71.4%
gemma-3-27b turkish_sentiment_large 0-2 zero_shot 93.8% 93.8% 93.8%
mistral-small persuade 1-6 zero_shot 50.0% 50.0% 50.0%
gemma-3-27b turkish_offensive 0-1 zero_shot 100.0% 100.0% 100.0%
mistral-small turkish_movie_sentiment 0-5 zero_shot 64.3% 64.3% 64.3%
mistral-small turkish_sentiment_large 0-2 zero_shot 87.5% 87.5% 87.5%
mistral-small turkish_offensive 0-1 zero_shot 100.0% 100.0% 100.0%
gpt-4o-mini persuade 1-6 zero_shot 100.0% 100.0% 100.0%
gpt-4o-mini turkish_movie_sentiment 0-5 zero_shot 85.7% 85.7% 85.7%
gpt-4o-mini turkish_offensive 0-1 zero_shot 100.0% 100.0% 100.0%
hermes-3-405b persuade 1-6 zero_shot 100.0% 100.0% 100.0%
hermes-3-405b turkish_movie_sentiment 0-5 zero_shot 92.9% 92.9% 78.6%
hermes-3-405b turkish_offensive 0-1 zero_shot 100.0% 100.0% 100.0%
gpt-4o-mini turkish_sentiment_large 0-2 zero_shot 100.0% 100.0% 100.0%
hermes-3-405b turkish_sentiment_large 0-2 zero_shot 87.5% 87.5% 87.5%
llama-3.3-70b persuade 1-6 few_shot 50.0% 50.0% 50.0%
llama-3.1-8b persuade 1-6 few_shot 100.0% 100.0% 50.0%
llama-3.2-3b persuade 1-6 few_shot 100.0% 100.0% 100.0%
gemma-3-27b persuade 1-6 few_shot 50.0% 50.0% 50.0%
mistral-small persuade 1-6 few_shot 50.0% 50.0% 50.0%
gpt-4o-mini persuade 1-6 few_shot 100.0% 100.0% 100.0%
llama-3.3-70b turkish_offensive 0-1 few_shot 100.0% 100.0% 100.0%
hermes-3-405b persuade 1-6 few_shot 100.0% 100.0% 100.0%
llama-3.1-8b turkish_offensive 0-1 few_shot 100.0% 100.0% 100.0%
llama-3.2-3b turkish_offensive 0-1 few_shot 100.0% 100.0% 100.0%
gemma-3-27b turkish_offensive 0-1 few_shot 100.0% 100.0% 100.0%
mistral-small turkish_offensive 0-1 few_shot 100.0% 100.0% 100.0%
llama-3.3-70b turkish_movie_sentiment 0-5 few_shot 78.6% 78.6% 78.6%
llama-3.1-8b turkish_movie_sentiment 0-5 few_shot 71.4% 64.3% 92.3%
llama-3.3-70b turkish_sentiment_large 0-2 few_shot 100.0% 100.0% 100.0%
gpt-4o-mini turkish_offensive 0-1 few_shot 100.0% 100.0% 100.0%
llama-3.2-3b turkish_movie_sentiment 0-5 few_shot 85.7% 84.6% 53.8%
llama-3.1-8b turkish_sentiment_large 0-2 few_shot 100.0% 100.0% 100.0%
llama-3.2-3b turkish_sentiment_large 0-2 few_shot 100.0% 100.0% 93.3%
gemma-3-27b turkish_movie_sentiment 0-5 few_shot 76.9% 69.2% 69.2%
llama-3.3-70b persuade 1-6 rubric_based 0.0% 0.0% 0.0%
gemma-3-27b turkish_sentiment_large 0-2 few_shot 93.3% 93.3% 93.3%
mistral-small turkish_movie_sentiment 0-5 few_shot 61.5% 61.5% 61.5%
llama-3.1-8b persuade 1-6 rubric_based 0.0% 0.0% 0.0%
llama-3.2-3b persuade 1-6 rubric_based 0.0% 50.0% 100.0%
mistral-small turkish_sentiment_large 0-2 few_shot 86.7% 86.7% 86.7%
gpt-4o-mini turkish_movie_sentiment 0-5 few_shot 84.6% 84.6% 69.2%
gemma-3-27b persuade 1-6 rubric_based 0.0% 0.0% 0.0%
hermes-3-405b turkish_offensive 0-1 few_shot 100.0% 100.0% 100.0%
hermes-3-405b turkish_movie_sentiment 0-5 few_shot 92.3% 92.3% 92.3%
mistral-small persuade 1-6 rubric_based 0.0% 0.0% 0.0%
hermes-3-405b turkish_sentiment_large 0-2 few_shot 86.7% 86.7% 86.7%
gpt-4o-mini turkish_sentiment_large 0-2 few_shot 100.0% 100.0% 93.3%
hermes-3-405b persuade 1-6 rubric_based 0.0% 0.0% 0.0%
gpt-4o-mini persuade 1-6 rubric_based 0.0% 0.0% 0.0%
llama-3.3-70b turkish_offensive 0-1 rubric_based 100.0% 100.0% 100.0%
llama-3.1-8b turkish_offensive 0-1 rubric_based 100.0% 100.0% 100.0%
llama-3.2-3b turkish_offensive 0-1 rubric_based 100.0% 100.0% 100.0%
gemma-3-27b turkish_offensive 0-1 rubric_based 100.0% 100.0% 100.0%
mistral-small turkish_offensive 0-1 rubric_based 100.0% 100.0% 100.0%
llama-3.3-70b persuade 1-6 rag_rubric 0.0% 0.0% -
hermes-3-405b turkish_offensive 0-1 rubric_based 100.0% 100.0% 100.0%
gpt-4o-mini turkish_offensive 0-1 rubric_based 100.0% 100.0% 100.0%
llama-3.3-70b turkish_sentiment_large 0-2 rubric_based 100.0% 100.0% 100.0%
llama-3.3-70b turkish_movie_sentiment 0-5 rubric_based 75.0% 66.7% 66.7%
llama-3.1-8b brighter_emotion 0-3 zero_shot 87.5% 93.8% 75.0%
llama-3.3-70b brighter_emotion 0-3 zero_shot 100.0% 100.0% 100.0%
llama-3.1-8b turkish_sentiment_large 0-2 rubric_based 100.0% 100.0% 92.9%
llama-3.1-8b turkish_movie_sentiment 0-5 rubric_based 66.7% 66.7% 75.0%
llama-3.2-3b brighter_emotion 0-3 zero_shot 93.8% 87.5% 81.2%
llama-3.2-3b turkish_sentiment_large 0-2 rubric_based 100.0% 100.0% 100.0%
llama-3.2-3b turkish_movie_sentiment 0-5 rubric_based 83.3% 83.3% 75.0%
gemma-3-27b brighter_emotion 0-3 zero_shot 87.5% 81.2% 81.2%
gemma-3-27b turkish_sentiment_large 0-2 rubric_based 92.9% 92.9% 92.9%
mistral-small brighter_emotion 0-3 zero_shot 100.0% 100.0% 100.0%
mistral-small turkish_sentiment_large 0-2 rubric_based 85.7% 85.7% 85.7%
gemma-3-27b turkish_movie_sentiment 0-5 rubric_based 75.0% 66.7% 75.0%
gpt-4o-mini brighter_emotion 0-3 zero_shot 93.8% 93.8% 93.8%
mistral-small turkish_movie_sentiment 0-5 rubric_based 58.3% 58.3% 58.3%
hermes-3-405b brighter_emotion 0-3 zero_shot 93.8% 93.8% 87.5%
llama-3.3-70b turkish_offensive 0-1 rag_rubric 100.0% 100.0% -
gpt-4o-mini turkish_sentiment_large 0-2 rubric_based 100.0% 100.0% 100.0%
hermes-3-405b turkish_sentiment_large 0-2 rubric_based 85.7% 85.7% 85.7%
hermes-3-405b turkish_movie_sentiment 0-5 rubric_based 91.7% 91.7% 91.7%
gpt-4o-mini turkish_movie_sentiment 0-5 rubric_based 75.0% 66.7% 66.7%
llama-3.3-70b brighter_emotion 0-3 few_shot 100.0% 100.0% 100.0%
llama-3.1-8b brighter_emotion 0-3 few_shot 87.5% 87.5% 93.8%
llama-3.2-3b brighter_emotion 0-3 few_shot 93.8% 93.8% 93.8%
mistral-small brighter_emotion 0-3 few_shot 100.0% 100.0% 100.0%
gemma-3-27b brighter_emotion 0-3 few_shot 87.5% 87.5% 81.2%
llama-3.3-70b turkish_sentiment_large 0-2 rag_rubric 100.0% - -
llama-3.3-70b turkish_movie_sentiment 0-5 rag_rubric 100.0% - -
gpt-4o-mini brighter_emotion 0-3 few_shot 93.8% 93.8% 93.8%
hermes-3-405b brighter_emotion 0-3 few_shot 93.8% 93.8% 93.8%
llama-3.3-70b brighter_emotion 0-3 rubric_based 100.0% 100.0% 100.0%
llama-3.1-8b brighter_emotion 0-3 rubric_based 87.5% 81.2% 81.2%
llama-3.2-3b brighter_emotion 0-3 rubric_based 93.8% 93.8% 68.8%
mistral-small brighter_emotion 0-3 rubric_based 100.0% 100.0% 100.0%
gemma-3-27b brighter_emotion 0-3 rubric_based 87.5% 87.5% 87.5%
gpt-4o-mini brighter_emotion 0-3 rubric_based 93.3% 93.3% 93.3%
hermes-3-405b brighter_emotion 0-3 rubric_based 93.3% 93.3% 86.7%
llama-3.3-70b brighter_emotion 0-3 rag_rubric 100.0% - -
Model Dataset Range Strategy T=0.0 T=0.2 T=1.0
llama-3.3-70b persuade 1-6 zero_shot 0.000 0.000 0.000
llama-3.3-70b turkish_movie_sentiment 0-5 zero_shot 0.667 0.693 0.667
llama-3.3-70b turkish_sentiment_large 0-2 zero_shot 0.667 0.667 0.517
llama-3.1-8b persuade 1-6 zero_shot 0.000 0.000 0.000
llama-3.3-70b turkish_offensive 0-1 zero_shot 0.596 0.486 0.596
llama-3.1-8b turkish_sentiment_large 0-2 zero_shot 0.565 0.565 0.517
llama-3.1-8b turkish_movie_sentiment 0-5 zero_shot 0.072 0.165 0.510
llama-3.1-8b turkish_offensive 0-1 zero_shot -0.118 -0.154 0.100
llama-3.2-3b persuade 1-6 zero_shot 0.000 0.000 0.000
llama-3.2-3b turkish_sentiment_large 0-2 zero_shot 0.565 0.412 0.053
llama-3.2-3b turkish_movie_sentiment 0-5 zero_shot 0.680 0.596 0.390
llama-3.2-3b turkish_offensive 0-1 zero_shot -0.286 -0.154 -0.286
gemma-3-27b persuade 1-6 zero_shot 0.000 0.000 0.000
gemma-3-27b turkish_movie_sentiment 0-5 zero_shot 0.646 0.724 0.628
gemma-3-27b turkish_sentiment_large 0-2 zero_shot 0.667 0.667 0.667
mistral-small persuade 1-6 zero_shot 0.000 0.000 0.000
gemma-3-27b turkish_offensive 0-1 zero_shot 0.341 0.341 0.341
mistral-small turkish_movie_sentiment 0-5 zero_shot 0.657 0.657 0.657
mistral-small turkish_sentiment_large 0-2 zero_shot 0.517 0.517 0.517
mistral-small turkish_offensive 0-1 zero_shot 0.341 0.341 0.341
gpt-4o-mini persuade 1-6 zero_shot 0.000 0.000 0.000
gpt-4o-mini turkish_movie_sentiment 0-5 zero_shot 0.665 0.665 0.656
gpt-4o-mini turkish_offensive 0-1 zero_shot 0.341 0.341 0.341
hermes-3-405b persuade 1-6 zero_shot 0.000 0.000 0.000
hermes-3-405b turkish_movie_sentiment 0-5 zero_shot 0.724 0.709 0.696
hermes-3-405b turkish_offensive 0-1 zero_shot 0.571 0.571 0.458
gpt-4o-mini turkish_sentiment_large 0-2 zero_shot 0.667 0.667 0.667
hermes-3-405b turkish_sentiment_large 0-2 zero_shot 0.412 0.517 0.412
llama-3.3-70b persuade 1-6 few_shot 0.000 0.000 0.000
llama-3.1-8b persuade 1-6 few_shot 0.000 0.000 0.000
llama-3.2-3b persuade 1-6 few_shot 0.000 0.000 0.000
gemma-3-27b persuade 1-6 few_shot 0.000 0.000 0.000
mistral-small persuade 1-6 few_shot 0.000 0.000 0.000
gpt-4o-mini persuade 1-6 few_shot 0.000 0.000 0.000
llama-3.3-70b turkish_offensive 0-1 few_shot 0.571 0.571 0.571
hermes-3-405b persuade 1-6 few_shot 0.000 0.000 0.000
llama-3.1-8b turkish_offensive 0-1 few_shot -0.154 -0.154 0.100
llama-3.2-3b turkish_offensive 0-1 few_shot -0.421 -0.286 -0.286
gemma-3-27b turkish_offensive 0-1 few_shot 0.341 0.341 0.458
mistral-small turkish_offensive 0-1 few_shot 0.341 0.341 0.341
llama-3.3-70b turkish_movie_sentiment 0-5 few_shot 0.639 0.667 0.667
llama-3.1-8b turkish_movie_sentiment 0-5 few_shot 0.085 0.085 0.542
llama-3.3-70b turkish_sentiment_large 0-2 few_shot 0.659 0.659 0.659
gpt-4o-mini turkish_offensive 0-1 few_shot 0.341 0.341 0.341
llama-3.2-3b turkish_movie_sentiment 0-5 few_shot 0.486 0.678 0.223
llama-3.1-8b turkish_sentiment_large 0-2 few_shot 0.556 0.648 0.556
llama-3.2-3b turkish_sentiment_large 0-2 few_shot 0.318 0.426 0.315
gemma-3-27b turkish_movie_sentiment 0-5 few_shot 0.641 0.641 0.641
llama-3.3-70b persuade 1-6 rubric_based 0.000 0.000 0.000
gemma-3-27b turkish_sentiment_large 0-2 few_shot 0.659 0.659 0.659
mistral-small turkish_movie_sentiment 0-5 few_shot 0.653 0.653 0.653
llama-3.1-8b persuade 1-6 rubric_based 0.000 0.000 0.000
llama-3.2-3b persuade 1-6 rubric_based 0.000 0.000 0.000
mistral-small turkish_sentiment_large 0-2 few_shot 0.507 0.507 0.507
gpt-4o-mini turkish_movie_sentiment 0-5 few_shot 0.681 0.681 0.627
gemma-3-27b persuade 1-6 rubric_based 0.000 0.000 0.000
hermes-3-405b turkish_offensive 0-1 few_shot 0.571 0.571 0.571
hermes-3-405b turkish_movie_sentiment 0-5 few_shot 0.721 0.721 0.721
mistral-small persuade 1-6 rubric_based 0.000 0.000 0.000
hermes-3-405b turkish_sentiment_large 0-2 few_shot 0.507 0.507 0.400
gpt-4o-mini turkish_sentiment_large 0-2 few_shot 0.659 0.659 0.659
hermes-3-405b persuade 1-6 rubric_based 0.000 0.000 0.000
gpt-4o-mini persuade 1-6 rubric_based 0.000 0.000 0.000
llama-3.3-70b turkish_offensive 0-1 rubric_based 0.571 0.571 0.341
llama-3.1-8b turkish_offensive 0-1 rubric_based -0.154 -0.025 0.100
llama-3.2-3b turkish_offensive 0-1 rubric_based -0.421 -0.421 -0.269
gemma-3-27b turkish_offensive 0-1 rubric_based 0.341 0.341 0.341
mistral-small turkish_offensive 0-1 rubric_based 0.341 0.341 0.341
llama-3.3-70b persuade 1-6 rag_rubric - - -
hermes-3-405b turkish_offensive 0-1 rubric_based 0.571 0.571 0.458
gpt-4o-mini turkish_offensive 0-1 rubric_based 0.341 0.341 0.341
llama-3.3-70b turkish_sentiment_large 0-2 rubric_based 0.650 0.650 0.650
llama-3.3-70b turkish_movie_sentiment 0-5 rubric_based 0.649 0.604 0.679
llama-3.1-8b brighter_emotion 0-3 zero_shot 0.569 0.679 0.429
llama-3.3-70b brighter_emotion 0-3 zero_shot 0.701 0.657 0.683
llama-3.1-8b turkish_sentiment_large 0-2 rubric_based 0.650 0.639 0.545
llama-3.1-8b turkish_movie_sentiment 0-5 rubric_based 0.000 0.242 0.045
llama-3.2-3b brighter_emotion 0-3 zero_shot 0.757 0.581 0.667
llama-3.2-3b turkish_sentiment_large 0-2 rubric_based 0.300 0.386 0.440
llama-3.2-3b turkish_movie_sentiment 0-5 rubric_based 0.480 0.480 0.455
gemma-3-27b brighter_emotion 0-3 zero_shot 0.429 0.429 0.565
gemma-3-27b turkish_sentiment_large 0-2 rubric_based 0.650 0.650 0.650
mistral-small brighter_emotion 0-3 zero_shot 0.799 0.799 0.799
mistral-small turkish_sentiment_large 0-2 rubric_based 0.495 0.495 0.495
gemma-3-27b turkish_movie_sentiment 0-5 rubric_based 0.636 0.636 0.636
gpt-4o-mini brighter_emotion 0-3 zero_shot 0.617 0.617 0.657
mistral-small turkish_movie_sentiment 0-5 rubric_based 0.657 0.657 0.657
hermes-3-405b brighter_emotion 0-3 zero_shot 0.765 0.729 0.806
llama-3.3-70b turkish_offensive 0-1 rag_rubric - - -
gpt-4o-mini turkish_sentiment_large 0-2 rubric_based 0.650 0.650 0.650
hermes-3-405b turkish_sentiment_large 0-2 rubric_based 0.495 0.495 0.386
hermes-3-405b turkish_movie_sentiment 0-5 rubric_based 0.717 0.717 0.717
gpt-4o-mini turkish_movie_sentiment 0-5 rubric_based 0.667 0.667 0.667
llama-3.3-70b brighter_emotion 0-3 few_shot 0.726 0.701 0.657
llama-3.1-8b brighter_emotion 0-3 few_shot 0.535 0.563 0.611
llama-3.2-3b brighter_emotion 0-3 few_shot 0.727 0.721 0.781
mistral-small brighter_emotion 0-3 few_shot 0.799 0.799 0.799
gemma-3-27b brighter_emotion 0-3 few_shot 0.429 0.429 0.429
llama-3.3-70b turkish_sentiment_large 0-2 rag_rubric - - -
llama-3.3-70b turkish_movie_sentiment 0-5 rag_rubric - - -
gpt-4o-mini brighter_emotion 0-3 few_shot 0.617 0.617 0.485
hermes-3-405b brighter_emotion 0-3 few_shot 0.765 0.765 0.776
llama-3.3-70b brighter_emotion 0-3 rubric_based 0.657 0.701 0.631
llama-3.1-8b brighter_emotion 0-3 rubric_based 0.535 0.588 0.478
llama-3.2-3b brighter_emotion 0-3 rubric_based 0.782 0.721 0.302
mistral-small brighter_emotion 0-3 rubric_based 0.799 0.795 0.795
gemma-3-27b brighter_emotion 0-3 rubric_based 0.429 0.429 0.565
gpt-4o-mini brighter_emotion 0-3 rubric_based 0.643 0.580 0.580
hermes-3-405b brighter_emotion 0-3 rubric_based 0.762 0.726 0.726
llama-3.3-70b brighter_emotion 0-3 rag_rubric - - -
Model Dataset Range Strategy T=0.0 T=0.2 T=1.0
llama-3.3-70b persuade 1-6 zero_shot 0.000 0.000 0.000
llama-3.3-70b turkish_movie_sentiment 0-5 zero_shot 0.639 0.752 0.658
llama-3.3-70b turkish_sentiment_large 0-2 zero_shot 0.703 0.715 0.667
llama-3.1-8b persuade 1-6 zero_shot 0.000 0.000 0.000
llama-3.3-70b turkish_offensive 0-1 zero_shot 0.651 0.511 0.651
llama-3.1-8b turkish_sentiment_large 0-2 zero_shot 0.476 0.475 0.678
llama-3.1-8b turkish_movie_sentiment 0-5 zero_shot 0.257 0.277 0.441
llama-3.1-8b turkish_offensive 0-1 zero_shot -0.121 -0.158 0.100
llama-3.2-3b persuade 1-6 zero_shot 0.000 0.000 0.000
llama-3.2-3b turkish_sentiment_large 0-2 zero_shot 0.596 0.440 0.065
llama-3.2-3b turkish_movie_sentiment 0-5 zero_shot 0.703 0.516 0.400
llama-3.2-3b turkish_offensive 0-1 zero_shot -0.305 -0.158 -0.305
gemma-3-27b persuade 1-6 zero_shot 0.000 0.000 0.000
gemma-3-27b turkish_movie_sentiment 0-5 zero_shot 0.407 0.471 0.349
gemma-3-27b turkish_sentiment_large 0-2 zero_shot 0.672 0.672 0.672
mistral-small persuade 1-6 zero_shot 0.000 0.000 0.000
gemma-3-27b turkish_offensive 0-1 zero_shot 0.350 0.350 0.350
mistral-small turkish_movie_sentiment 0-5 zero_shot 0.652 0.652 0.652
mistral-small turkish_sentiment_large 0-2 zero_shot 0.591 0.591 0.591
mistral-small turkish_offensive 0-1 zero_shot 0.350 0.350 0.350
gpt-4o-mini persuade 1-6 zero_shot 0.000 0.000 0.000
gpt-4o-mini turkish_movie_sentiment 0-5 zero_shot 0.602 0.602 0.595
gpt-4o-mini turkish_offensive 0-1 zero_shot 0.350 0.350 0.350
hermes-3-405b persuade 1-6 zero_shot 0.000 0.000 0.000
hermes-3-405b turkish_movie_sentiment 0-5 zero_shot 0.668 0.608 0.568
hermes-3-405b turkish_offensive 0-1 zero_shot 0.632 0.632 0.484
gpt-4o-mini turkish_sentiment_large 0-2 zero_shot 0.710 0.719 0.710
hermes-3-405b turkish_sentiment_large 0-2 zero_shot 0.576 0.591 0.591
llama-3.3-70b persuade 1-6 few_shot 0.000 0.000 0.000
llama-3.1-8b persuade 1-6 few_shot 0.000 0.000 0.000
llama-3.2-3b persuade 1-6 few_shot 0.000 0.000 0.000
gemma-3-27b persuade 1-6 few_shot 0.000 0.000 0.000
mistral-small persuade 1-6 few_shot 0.000 0.000 0.000
gpt-4o-mini persuade 1-6 few_shot 0.000 0.000 0.000
llama-3.3-70b turkish_offensive 0-1 few_shot 0.632 0.632 0.632
hermes-3-405b persuade 1-6 few_shot 0.000 0.000 0.000
llama-3.1-8b turkish_offensive 0-1 few_shot -0.158 -0.158 0.100
llama-3.2-3b turkish_offensive 0-1 few_shot -0.478 -0.305 -0.305
gemma-3-27b turkish_offensive 0-1 few_shot 0.350 0.350 0.484
mistral-small turkish_offensive 0-1 few_shot 0.350 0.350 0.350
llama-3.3-70b turkish_movie_sentiment 0-5 few_shot 0.529 0.639 0.639
llama-3.1-8b turkish_movie_sentiment 0-5 few_shot 0.237 0.148 0.435
llama-3.3-70b turkish_sentiment_large 0-2 few_shot 0.715 0.715 0.724
gpt-4o-mini turkish_offensive 0-1 few_shot 0.350 0.350 0.350
llama-3.2-3b turkish_movie_sentiment 0-5 few_shot 0.675 0.704 0.135
llama-3.1-8b turkish_sentiment_large 0-2 few_shot 0.498 0.768 0.572
llama-3.2-3b turkish_sentiment_large 0-2 few_shot 0.354 0.497 0.462
gemma-3-27b turkish_movie_sentiment 0-5 few_shot 0.447 0.391 0.391
llama-3.3-70b persuade 1-6 rubric_based 0.000 0.000 0.000
gemma-3-27b turkish_sentiment_large 0-2 few_shot 0.678 0.678 0.678
mistral-small turkish_movie_sentiment 0-5 few_shot 0.702 0.702 0.702
llama-3.1-8b persuade 1-6 rubric_based 0.000 0.000 0.000
llama-3.2-3b persuade 1-6 rubric_based 0.000 0.000 0.000
mistral-small turkish_sentiment_large 0-2 few_shot 0.590 0.590 0.590
gpt-4o-mini turkish_movie_sentiment 0-5 few_shot 0.657 0.657 0.528
gemma-3-27b persuade 1-6 rubric_based 0.000 0.000 0.000
hermes-3-405b turkish_offensive 0-1 few_shot 0.632 0.632 0.632
hermes-3-405b turkish_movie_sentiment 0-5 few_shot 0.725 0.725 0.736
mistral-small persuade 1-6 rubric_based 0.000 0.000 0.000
hermes-3-405b turkish_sentiment_large 0-2 few_shot 0.590 0.590 0.588
gpt-4o-mini turkish_sentiment_large 0-2 few_shot 0.731 0.720 0.678
hermes-3-405b persuade 1-6 rubric_based 0.000 0.000 0.000
gpt-4o-mini persuade 1-6 rubric_based 0.000 0.000 0.000
llama-3.3-70b turkish_offensive 0-1 rubric_based 0.632 0.632 0.350
llama-3.1-8b turkish_offensive 0-1 rubric_based -0.158 -0.025 0.100
llama-3.2-3b turkish_offensive 0-1 rubric_based -0.478 -0.478 -0.278
gemma-3-27b turkish_offensive 0-1 rubric_based 0.350 0.350 0.350
mistral-small turkish_offensive 0-1 rubric_based 0.350 0.350 0.350
llama-3.3-70b persuade 1-6 rag_rubric - - -
hermes-3-405b turkish_offensive 0-1 rubric_based 0.632 0.632 0.484
gpt-4o-mini turkish_offensive 0-1 rubric_based 0.350 0.350 0.350
llama-3.3-70b turkish_sentiment_large 0-2 rubric_based 0.728 0.718 0.725
llama-3.3-70b turkish_movie_sentiment 0-5 rubric_based 0.620 0.452 0.735
llama-3.1-8b brighter_emotion 0-3 zero_shot 0.675 0.746 0.559
llama-3.3-70b brighter_emotion 0-3 zero_shot 0.763 0.744 0.778
llama-3.1-8b turkish_sentiment_large 0-2 rubric_based 0.767 0.787 0.450
llama-3.1-8b turkish_movie_sentiment 0-5 rubric_based 0.156 0.224 0.044
llama-3.2-3b brighter_emotion 0-3 zero_shot 0.844 0.747 0.618
llama-3.2-3b turkish_sentiment_large 0-2 rubric_based 0.319 0.423 0.568
llama-3.2-3b turkish_movie_sentiment 0-5 rubric_based 0.607 0.566 0.486
gemma-3-27b brighter_emotion 0-3 zero_shot 0.476 0.466 0.493
gemma-3-27b turkish_sentiment_large 0-2 rubric_based 0.678 0.678 0.678
mistral-small brighter_emotion 0-3 zero_shot 0.869 0.869 0.877
mistral-small turkish_sentiment_large 0-2 rubric_based 0.579 0.579 0.579
gemma-3-27b turkish_movie_sentiment 0-5 rubric_based 0.479 0.430 0.479
gpt-4o-mini brighter_emotion 0-3 zero_shot 0.632 0.599 0.642
mistral-small turkish_movie_sentiment 0-5 rubric_based 0.767 0.767 0.767
hermes-3-405b brighter_emotion 0-3 zero_shot 0.852 0.800 0.823
llama-3.3-70b turkish_offensive 0-1 rag_rubric - - -
gpt-4o-mini turkish_sentiment_large 0-2 rubric_based 0.725 0.725 0.767
hermes-3-405b turkish_sentiment_large 0-2 rubric_based 0.578 0.578 0.560
hermes-3-405b turkish_movie_sentiment 0-5 rubric_based 0.736 0.736 0.738
gpt-4o-mini turkish_movie_sentiment 0-5 rubric_based 0.747 0.616 0.602
llama-3.3-70b brighter_emotion 0-3 few_shot 0.802 0.763 0.744
llama-3.1-8b brighter_emotion 0-3 few_shot 0.663 0.683 0.801
llama-3.2-3b brighter_emotion 0-3 few_shot 0.795 0.798 0.829
mistral-small brighter_emotion 0-3 few_shot 0.869 0.869 0.869
gemma-3-27b brighter_emotion 0-3 few_shot 0.511 0.526 0.451
llama-3.3-70b turkish_sentiment_large 0-2 rag_rubric - - -
llama-3.3-70b turkish_movie_sentiment 0-5 rag_rubric - - -
gpt-4o-mini brighter_emotion 0-3 few_shot 0.613 0.599 0.586
hermes-3-405b brighter_emotion 0-3 few_shot 0.810 0.770 0.727
llama-3.3-70b brighter_emotion 0-3 rubric_based 0.744 0.763 0.764
llama-3.1-8b brighter_emotion 0-3 rubric_based 0.662 0.644 0.569
llama-3.2-3b brighter_emotion 0-3 rubric_based 0.851 0.798 0.261
mistral-small brighter_emotion 0-3 rubric_based 0.869 0.859 0.850
gemma-3-27b brighter_emotion 0-3 rubric_based 0.511 0.532 0.601
gpt-4o-mini brighter_emotion 0-3 rubric_based 0.606 0.587 0.548
hermes-3-405b brighter_emotion 0-3 rubric_based 0.799 0.796 0.737
llama-3.3-70b brighter_emotion 0-3 rag_rubric - - -

🔍 RAG Strategy Performance

Comparison between RAG-enhanced strategies and baseline strategies. Normalized MAE shown (0-1 scale).

Model Dataset Range few_shot rubric_based rag_rubric
llama-3.3-70b persuade 1-6 0.163 0.390 0.400
llama-3.3-70b turkish_movie_sentiment 0-5 0.190 0.207 0.200
llama-3.3-70b turkish_sentiment_large 0-2 0.154 0.163 0.000
llama-3.1-8b persuade 1-6 0.103 0.350 -
llama-3.3-70b turkish_offensive 0-1 0.222 0.259 0.000
llama-3.1-8b turkish_sentiment_large 0-2 0.176 0.179 -
llama-3.1-8b turkish_movie_sentiment 0-5 0.202 0.218 -
llama-3.1-8b turkish_offensive 0-1 0.519 0.500 -
llama-3.2-3b persuade 1-6 0.025 0.217 -
llama-3.2-3b turkish_sentiment_large 0-2 0.298 0.275 -
llama-3.2-3b turkish_movie_sentiment 0-5 0.163 0.162 -
llama-3.2-3b turkish_offensive 0-1 0.630 0.642 -
gemma-3-27b persuade 1-6 0.150 0.390 -
gemma-3-27b turkish_movie_sentiment 0-5 0.193 0.207 -
gemma-3-27b turkish_sentiment_large 0-2 0.161 0.171 -
mistral-small persuade 1-6 0.250 0.375 -
gemma-3-27b turkish_offensive 0-1 0.315 0.333 -
mistral-small turkish_movie_sentiment 0-5 0.200 0.200 -
mistral-small turkish_sentiment_large 0-2 0.195 0.209 -
mistral-small turkish_offensive 0-1 0.333 0.333 -
gpt-4o-mini persuade 1-6 0.000 0.383 -
gpt-4o-mini turkish_movie_sentiment 0-5 0.207 0.209 -
gpt-4o-mini turkish_offensive 0-1 0.333 0.333 -
hermes-3-405b persuade 1-6 0.050 0.367 -
hermes-3-405b turkish_movie_sentiment 0-5 0.153 0.166 -
hermes-3-405b turkish_offensive 0-1 0.222 0.241 -
gpt-4o-mini turkish_sentiment_large 0-2 0.154 0.157 -
hermes-3-405b turkish_sentiment_large 0-2 0.194 0.207 -
llama-3.1-8b brighter_emotion 0-3 0.211 0.230 -
llama-3.3-70b brighter_emotion 0-3 0.212 0.223 0.000
llama-3.2-3b brighter_emotion 0-3 0.175 0.241 -
gemma-3-27b brighter_emotion 0-3 0.201 0.193 -
mistral-small brighter_emotion 0-3 0.148 0.150 -
gpt-4o-mini brighter_emotion 0-3 0.216 0.214 -
hermes-3-405b brighter_emotion 0-3 0.161 0.160 -
Model Dataset Range few_shot rubric_based rag_rubric
llama-3.3-70b persuade 1-6 0.817 1.950 2.000
llama-3.3-70b turkish_movie_sentiment 0-5 0.952 1.035 1.000
llama-3.3-70b turkish_sentiment_large 0-2 0.309 0.326 0.000
llama-3.1-8b persuade 1-6 0.517 1.750 -
llama-3.3-70b turkish_offensive 0-1 0.222 0.259 0.000
llama-3.1-8b turkish_sentiment_large 0-2 0.352 0.358 -
llama-3.1-8b turkish_movie_sentiment 0-5 1.009 1.089 -
llama-3.1-8b turkish_offensive 0-1 0.519 0.500 -
llama-3.2-3b persuade 1-6 0.125 1.083 -
llama-3.2-3b turkish_sentiment_large 0-2 0.596 0.550 -
llama-3.2-3b turkish_movie_sentiment 0-5 0.812 0.812 -
llama-3.2-3b turkish_offensive 0-1 0.630 0.642 -
gemma-3-27b persuade 1-6 0.750 1.950 -
gemma-3-27b turkish_movie_sentiment 0-5 0.963 1.036 -
gemma-3-27b turkish_sentiment_large 0-2 0.321 0.343 -
mistral-small persuade 1-6 1.250 1.875 -
gemma-3-27b turkish_offensive 0-1 0.315 0.333 -
mistral-small turkish_movie_sentiment 0-5 1.000 1.000 -
mistral-small turkish_sentiment_large 0-2 0.390 0.418 -
mistral-small turkish_offensive 0-1 0.333 0.333 -
gpt-4o-mini persuade 1-6 0.000 1.917 -
gpt-4o-mini turkish_movie_sentiment 0-5 1.036 1.047 -
gpt-4o-mini turkish_offensive 0-1 0.333 0.333 -
hermes-3-405b persuade 1-6 0.250 1.833 -
hermes-3-405b turkish_movie_sentiment 0-5 0.765 0.831 -
hermes-3-405b turkish_offensive 0-1 0.222 0.241 -
gpt-4o-mini turkish_sentiment_large 0-2 0.308 0.313 -
hermes-3-405b turkish_sentiment_large 0-2 0.388 0.414 -
llama-3.1-8b brighter_emotion 0-3 0.632 0.691 -
llama-3.3-70b brighter_emotion 0-3 0.635 0.669 0.000
llama-3.2-3b brighter_emotion 0-3 0.526 0.724 -
gemma-3-27b brighter_emotion 0-3 0.604 0.580 -
mistral-small brighter_emotion 0-3 0.444 0.450 -
gpt-4o-mini brighter_emotion 0-3 0.649 0.641 -
hermes-3-405b brighter_emotion 0-3 0.484 0.480 -
Model Dataset Range few_shot rubric_based rag_rubric
llama-3.3-70b persuade 1-6 50.0% 0.0% 0.0%
llama-3.3-70b turkish_movie_sentiment 0-5 4.8% 2.8% 0.0%
llama-3.3-70b turkish_sentiment_large 0-2 73.3% 71.4% 100.0%
llama-3.1-8b persuade 1-6 66.7% 0.0% -
llama-3.3-70b turkish_offensive 0-1 77.8% 74.1% 100.0%
llama-3.1-8b turkish_sentiment_large 0-2 66.7% 66.7% -
llama-3.1-8b turkish_movie_sentiment 0-5 29.3% 13.9% -
llama-3.1-8b turkish_offensive 0-1 48.1% 50.0% -
llama-3.2-3b persuade 1-6 100.0% 33.3% -
llama-3.2-3b turkish_sentiment_large 0-2 53.3% 54.8% -
llama-3.2-3b turkish_movie_sentiment 0-5 27.5% 16.7% -
llama-3.2-3b turkish_offensive 0-1 37.0% 35.8% -
gemma-3-27b persuade 1-6 50.0% 0.0% -
gemma-3-27b turkish_movie_sentiment 0-5 23.1% 16.7% -
gemma-3-27b turkish_sentiment_large 0-2 73.3% 71.4% -
mistral-small persuade 1-6 0.0% 0.0% -
gemma-3-27b turkish_offensive 0-1 68.5% 66.7% -
mistral-small turkish_movie_sentiment 0-5 15.4% 16.7% -
mistral-small turkish_sentiment_large 0-2 73.3% 71.4% -
mistral-small turkish_offensive 0-1 66.7% 66.7% -
gpt-4o-mini persuade 1-6 100.0% 0.0% -
gpt-4o-mini turkish_movie_sentiment 0-5 12.8% 16.7% -
gpt-4o-mini turkish_offensive 0-1 66.7% 66.7% -
hermes-3-405b persuade 1-6 100.0% 0.0% -
hermes-3-405b turkish_movie_sentiment 0-5 23.1% 16.7% -
hermes-3-405b turkish_offensive 0-1 77.8% 75.9% -
gpt-4o-mini turkish_sentiment_large 0-2 73.3% 71.4% -
hermes-3-405b turkish_sentiment_large 0-2 73.3% 71.4% -
llama-3.1-8b brighter_emotion 0-3 31.2% 31.2% -
llama-3.3-70b brighter_emotion 0-3 31.2% 27.1% 100.0%
llama-3.2-3b brighter_emotion 0-3 54.2% 41.7% -
gemma-3-27b brighter_emotion 0-3 56.2% 56.2% -
mistral-small brighter_emotion 0-3 56.2% 54.3% -
gpt-4o-mini brighter_emotion 0-3 27.1% 28.9% -
hermes-3-405b brighter_emotion 0-3 56.2% 48.9% -
Model Dataset Range few_shot rubric_based rag_rubric
llama-3.3-70b persuade 1-6 50.0% 0.0% 0.0%
llama-3.3-70b turkish_movie_sentiment 0-5 78.6% 69.4% 100.0%
llama-3.3-70b turkish_sentiment_large 0-2 100.0% 100.0% 100.0%
llama-3.1-8b persuade 1-6 83.3% 0.0% -
llama-3.3-70b turkish_offensive 0-1 100.0% 100.0% 100.0%
llama-3.1-8b turkish_sentiment_large 0-2 100.0% 97.6% -
llama-3.1-8b turkish_movie_sentiment 0-5 75.6% 69.4% -
llama-3.1-8b turkish_offensive 0-1 100.0% 100.0% -
llama-3.2-3b persuade 1-6 100.0% 50.0% -
llama-3.2-3b turkish_sentiment_large 0-2 97.8% 100.0% -
llama-3.2-3b turkish_movie_sentiment 0-5 75.0% 80.6% -
llama-3.2-3b turkish_offensive 0-1 100.0% 100.0% -
gemma-3-27b persuade 1-6 50.0% 0.0% -
gemma-3-27b turkish_movie_sentiment 0-5 71.8% 72.2% -
gemma-3-27b turkish_sentiment_large 0-2 93.3% 92.9% -
mistral-small persuade 1-6 50.0% 0.0% -
gemma-3-27b turkish_offensive 0-1 100.0% 100.0% -
mistral-small turkish_movie_sentiment 0-5 61.5% 58.3% -
mistral-small turkish_sentiment_large 0-2 86.7% 85.7% -
mistral-small turkish_offensive 0-1 100.0% 100.0% -
gpt-4o-mini persuade 1-6 100.0% 0.0% -
gpt-4o-mini turkish_movie_sentiment 0-5 79.5% 69.4% -
gpt-4o-mini turkish_offensive 0-1 100.0% 100.0% -
hermes-3-405b persuade 1-6 100.0% 0.0% -
hermes-3-405b turkish_movie_sentiment 0-5 92.3% 91.7% -
hermes-3-405b turkish_offensive 0-1 100.0% 100.0% -
gpt-4o-mini turkish_sentiment_large 0-2 97.8% 100.0% -
hermes-3-405b turkish_sentiment_large 0-2 86.7% 85.7% -
llama-3.1-8b brighter_emotion 0-3 89.6% 83.3% -
llama-3.3-70b brighter_emotion 0-3 100.0% 100.0% 100.0%
llama-3.2-3b brighter_emotion 0-3 93.8% 85.4% -
gemma-3-27b brighter_emotion 0-3 85.4% 87.5% -
mistral-small brighter_emotion 0-3 100.0% 100.0% -
gpt-4o-mini brighter_emotion 0-3 93.8% 93.3% -
hermes-3-405b brighter_emotion 0-3 93.8% 91.1% -
Model Dataset Range few_shot rubric_based rag_rubric
llama-3.3-70b persuade 1-6 0.000 0.000 0.000
llama-3.3-70b turkish_movie_sentiment 0-5 0.658 0.644 -
llama-3.3-70b turkish_sentiment_large 0-2 0.659 0.650 -
llama-3.1-8b persuade 1-6 0.000 0.000 -
llama-3.3-70b turkish_offensive 0-1 0.571 0.496 0.000
llama-3.1-8b turkish_sentiment_large 0-2 0.592 0.614 -
llama-3.1-8b turkish_movie_sentiment 0-5 0.238 0.097 -
llama-3.1-8b turkish_offensive 0-1 -0.068 -0.025 -
llama-3.2-3b persuade 1-6 0.000 0.000 -
llama-3.2-3b turkish_sentiment_large 0-2 0.353 0.387 -
llama-3.2-3b turkish_movie_sentiment 0-5 0.461 0.471 -
llama-3.2-3b turkish_offensive 0-1 -0.330 -0.376 -
gemma-3-27b persuade 1-6 0.000 0.000 -
gemma-3-27b turkish_movie_sentiment 0-5 0.641 0.636 -
gemma-3-27b turkish_sentiment_large 0-2 0.659 0.650 -
mistral-small persuade 1-6 0.000 0.000 -
gemma-3-27b turkish_offensive 0-1 0.381 0.341 -
mistral-small turkish_movie_sentiment 0-5 0.653 0.657 -
mistral-small turkish_sentiment_large 0-2 0.507 0.495 -
mistral-small turkish_offensive 0-1 0.341 0.341 -
gpt-4o-mini persuade 1-6 0.000 0.000 -
gpt-4o-mini turkish_movie_sentiment 0-5 0.660 0.667 -
gpt-4o-mini turkish_offensive 0-1 0.341 0.341 -
hermes-3-405b persuade 1-6 0.000 0.000 -
hermes-3-405b turkish_movie_sentiment 0-5 0.721 0.717 -
hermes-3-405b turkish_offensive 0-1 0.571 0.534 -
gpt-4o-mini turkish_sentiment_large 0-2 0.659 0.650 -
hermes-3-405b turkish_sentiment_large 0-2 0.467 0.455 -
llama-3.1-8b brighter_emotion 0-3 0.568 0.533 -
llama-3.3-70b brighter_emotion 0-3 0.695 0.664 -
llama-3.2-3b brighter_emotion 0-3 0.741 0.629 -
gemma-3-27b brighter_emotion 0-3 0.429 0.474 -
mistral-small brighter_emotion 0-3 0.799 0.796 -
gpt-4o-mini brighter_emotion 0-3 0.577 0.602 -
hermes-3-405b brighter_emotion 0-3 0.769 0.738 -
Model Dataset Range few_shot rubric_based rag_rubric
llama-3.3-70b persuade 1-6 0.000 0.000 0.000
llama-3.3-70b turkish_movie_sentiment 0-5 0.602 0.602 -
llama-3.3-70b turkish_sentiment_large 0-2 0.717 0.723 -
llama-3.1-8b persuade 1-6 0.000 0.000 -
llama-3.3-70b turkish_offensive 0-1 0.632 0.532 0.000
llama-3.1-8b turkish_sentiment_large 0-2 0.600 0.663 -
llama-3.1-8b turkish_movie_sentiment 0-5 0.240 0.117 -
llama-3.1-8b turkish_offensive 0-1 -0.069 -0.025 -
llama-3.2-3b persuade 1-6 0.000 0.000 -
llama-3.2-3b turkish_sentiment_large 0-2 0.428 0.424 -
llama-3.2-3b turkish_movie_sentiment 0-5 0.519 0.545 -
llama-3.2-3b turkish_offensive 0-1 -0.359 -0.411 -
gemma-3-27b persuade 1-6 0.000 0.000 -
gemma-3-27b turkish_movie_sentiment 0-5 0.409 0.462 -
gemma-3-27b turkish_sentiment_large 0-2 0.678 0.678 -
mistral-small persuade 1-6 0.000 0.000 -
gemma-3-27b turkish_offensive 0-1 0.394 0.350 -
mistral-small turkish_movie_sentiment 0-5 0.702 0.767 -
mistral-small turkish_sentiment_large 0-2 0.590 0.579 -
mistral-small turkish_offensive 0-1 0.350 0.350 -
gpt-4o-mini persuade 1-6 0.000 0.000 -
gpt-4o-mini turkish_movie_sentiment 0-5 0.604 0.646 -
gpt-4o-mini turkish_offensive 0-1 0.350 0.350 -
hermes-3-405b persuade 1-6 0.000 0.000 -
hermes-3-405b turkish_movie_sentiment 0-5 0.728 0.736 -
hermes-3-405b turkish_offensive 0-1 0.632 0.581 -
gpt-4o-mini turkish_sentiment_large 0-2 0.708 0.738 -
hermes-3-405b turkish_sentiment_large 0-2 0.585 0.571 -
llama-3.1-8b brighter_emotion 0-3 0.717 0.631 -
llama-3.3-70b brighter_emotion 0-3 0.769 0.757 -
llama-3.2-3b brighter_emotion 0-3 0.799 0.657 -
gemma-3-27b brighter_emotion 0-3 0.491 0.549 -
mistral-small brighter_emotion 0-3 0.869 0.860 -
gpt-4o-mini brighter_emotion 0-3 0.599 0.583 -
hermes-3-405b brighter_emotion 0-3 0.773 0.776 -

🎯 Strategy Comparison

All strategy+temperature+mode combinations. Click tabs to switch metrics.

Model Dataset Range few_shot@T0.0_C few_shot@T0.0_D few_shot@T0.2_C few_shot@T0.2_D few_shot@T1.0_C few_shot@T1.0_D rag_rubric@T0.0_D rag_rubric@T0.2_D rubric_based@T0.0_C rubric_based@T0.0_D rubric_based@T0.2_C rubric_based@T0.2_D rubric_based@T1.0_C rubric_based@T1.0_D zero_shot@T0.0_C zero_shot@T0.0_D zero_shot@T0.2_C zero_shot@T0.2_D zero_shot@T1.0_C zero_shot@T1.0_D
llama-3.3-70b persuade 1-6 0.300 0.000 0.300 0.000 0.380 0.000 0.400 0.400 0.380 0.400 0.380 0.400 0.380 0.400 0.300 0.000 0.300 0.000 0.300 0.000
llama-3.3-70b turkish_movie_sentiment 0-5 0.186 0.214 0.186 0.186 0.186 0.186 0.200 - 0.190 0.217 0.190 0.250 0.212 0.183 0.186 0.186 0.186 0.157 0.203 0.186
llama-3.3-70b turkish_sentiment_large 0-2 0.193 0.125 0.193 0.125 0.179 0.125 0.000 - 0.186 0.143 0.186 0.143 0.179 0.143 0.175 0.125 0.175 0.125 0.175 0.188
llama-3.1-8b persuade 1-6 0.040 0.000 0.040 0.200 0.340 0.000 - - 0.300 0.400 0.300 0.400 0.300 0.400 0.100 0.200 0.040 0.200 0.300 0.200
llama-3.3-70b turkish_offensive 0-1 0.222 0.222 0.222 0.222 0.222 0.222 0.000 0.000 0.222 0.222 0.222 0.222 0.333 0.333 0.222 0.200 0.333 0.200 0.222 0.200
llama-3.1-8b turkish_sentiment_large 0-2 0.164 0.188 0.179 0.188 0.146 0.188 - - 0.164 0.143 0.168 0.214 0.171 0.214 0.156 0.188 0.162 0.188 0.162 0.188
llama-3.1-8b turkish_movie_sentiment 0-5 0.257 0.186 0.256 0.186 0.097 0.214 - - 0.258 0.183 0.217 0.183 0.215 0.250 0.250 0.186 0.210 0.186 0.186 0.157
llama-3.1-8b turkish_offensive 0-1 0.556 0.556 0.556 0.556 0.333 0.556 - - 0.556 0.556 0.444 0.556 0.444 0.444 0.556 0.500 0.556 0.556 0.333 0.556
llama-3.2-3b persuade 1-6 0.050 0.000 0.050 0.000 0.050 0.000 - - 0.300 0.400 0.300 0.200 0.100 0.000 0.100 0.000 0.100 0.000 0.100 0.200
llama-3.2-3b turkish_sentiment_large 0-2 0.250 0.250 0.286 0.312 0.309 0.375 - - 0.250 0.286 0.250 0.214 0.294 0.357 0.281 0.188 0.250 0.188 0.297 0.250
llama-3.2-3b turkish_movie_sentiment 0-5 0.143 0.129 0.150 0.129 0.250 0.186 - - 0.158 0.150 0.175 0.150 0.224 0.117 0.114 0.129 0.114 0.157 0.207 0.186
llama-3.2-3b turkish_offensive 0-1 0.667 0.667 0.556 0.667 0.556 0.667 - - 0.667 0.667 0.667 0.667 0.500 0.667 0.667 0.556 0.556 0.556 0.667 0.556
gemma-3-27b persuade 1-6 0.300 0.000 0.300 0.000 0.300 0.000 - - 0.380 0.400 0.380 0.400 0.380 0.400 0.300 0.200 0.300 0.200 0.230 0.200
gemma-3-27b turkish_movie_sentiment 0-5 0.195 0.186 0.203 0.186 0.203 0.186 - - 0.195 0.217 0.203 0.217 0.195 0.217 0.174 0.186 0.174 0.157 0.174 0.186
gemma-3-27b turkish_sentiment_large 0-2 0.200 0.125 0.200 0.125 0.204 0.125 - - 0.200 0.143 0.200 0.143 0.200 0.143 0.184 0.125 0.178 0.125 0.188 0.125
mistral-small persuade 1-6 0.300 0.200 0.300 0.200 0.300 0.200 - - 0.350 0.400 0.350 0.400 0.350 0.400 0.300 0.200 0.300 0.200 0.300 0.200
gemma-3-27b turkish_offensive 0-1 0.333 0.333 0.333 0.333 0.333 0.222 - - 0.333 0.333 0.333 0.333 0.333 0.333 0.333 0.333 0.333 0.333 0.333 0.333
mistral-small turkish_movie_sentiment 0-5 0.217 0.186 0.217 0.186 0.217 0.186 - - 0.217 0.183 0.217 0.183 0.217 0.183 0.200 0.186 0.200 0.186 0.200 0.186
mistral-small turkish_sentiment_large 0-2 0.204 0.188 0.204 0.188 0.204 0.188 - - 0.204 0.214 0.204 0.214 0.204 0.214 0.188 0.188 0.188 0.188 0.188 0.188
mistral-small turkish_offensive 0-1 0.333 0.333 0.333 0.333 0.333 0.333 - - 0.333 0.333 0.333 0.333 0.333 0.333 0.333 0.333 0.333 0.333 0.333 0.333
gpt-4o-mini persuade 1-6 0.000 0.000 0.000 0.000 0.000 0.000 - - 0.400 0.400 0.400 0.400 0.300 0.400 0.100 0.200 0.100 0.200 0.200 0.200
gpt-4o-mini turkish_movie_sentiment 0-5 0.180 0.214 0.180 0.214 0.203 0.243 - - 0.203 0.217 0.210 0.217 0.193 0.217 0.166 0.214 0.166 0.214 0.177 0.214
gpt-4o-mini turkish_offensive 0-1 0.333 0.333 0.333 0.333 0.333 0.333 - - 0.333 0.333 0.333 0.333 0.333 0.333 0.333 0.333 0.333 0.333 0.333 0.333
hermes-3-405b persuade 1-6 0.100 0.000 0.100 0.000 0.100 0.000 - - 0.300 0.400 0.300 0.400 0.400 0.400 0.100 0.000 0.100 0.000 0.100 0.000
hermes-3-405b turkish_movie_sentiment 0-5 0.143 0.157 0.143 0.157 0.158 0.157 - - 0.143 0.183 0.143 0.183 0.160 0.183 0.130 0.157 0.130 0.186 0.151 0.186
hermes-3-405b turkish_offensive 0-1 0.222 0.222 0.222 0.222 0.222 0.222 - - 0.222 0.222 0.222 0.222 0.222 0.333 0.222 0.222 0.222 0.222 0.333 0.222
gpt-4o-mini turkish_sentiment_large 0-2 0.179 0.125 0.189 0.125 0.193 0.125 - - 0.189 0.143 0.189 0.143 0.132 0.143 0.178 0.125 0.169 0.125 0.178 0.125
hermes-3-405b turkish_sentiment_large 0-2 0.189 0.188 0.189 0.188 0.225 0.188 - - 0.196 0.214 0.196 0.214 0.207 0.214 0.203 0.188 0.188 0.188 0.216 0.188
llama-3.1-8b brighter_emotion 0-3 0.229 0.208 0.208 0.208 0.202 0.208 - - 0.240 0.208 0.240 0.208 0.194 0.292 0.208 0.208 0.221 0.167 0.246 0.250
llama-3.3-70b brighter_emotion 0-3 0.215 0.167 0.217 0.208 0.215 0.250 0.000 - 0.215 0.250 0.215 0.208 0.242 0.208 0.217 0.208 0.217 0.250 0.217 0.208
llama-3.2-3b brighter_emotion 0-3 0.219 0.167 0.198 0.208 0.177 0.083 - - 0.208 0.208 0.198 0.208 0.292 0.333 0.219 0.167 0.281 0.208 0.188 0.167
gemma-3-27b brighter_emotion 0-3 0.190 0.208 0.196 0.208 0.198 0.208 - - 0.190 0.208 0.179 0.208 0.208 0.167 0.194 0.208 0.202 0.208 0.179 0.208
mistral-small brighter_emotion 0-3 0.171 0.125 0.171 0.125 0.171 0.125 - - 0.171 0.125 0.179 0.125 0.183 0.125 0.171 0.125 0.171 0.125 0.167 0.125
gpt-4o-mini brighter_emotion 0-3 0.196 0.208 0.198 0.208 0.196 0.292 - - 0.186 0.208 0.188 0.250 0.190 0.250 0.188 0.208 0.198 0.208 0.217 0.125
hermes-3-405b brighter_emotion 0-3 0.165 0.125 0.135 0.167 0.169 0.208 - - 0.176 0.125 0.200 0.125 0.224 0.125 0.177 0.125 0.185 0.125 0.163 0.125
Model Dataset Range few_shot@T0.0_C few_shot@T0.0_D few_shot@T0.2_C few_shot@T0.2_D few_shot@T1.0_C few_shot@T1.0_D rag_rubric@T0.0_D rag_rubric@T0.2_D rubric_based@T0.0_C rubric_based@T0.0_D rubric_based@T0.2_C rubric_based@T0.2_D rubric_based@T1.0_C rubric_based@T1.0_D zero_shot@T0.0_C zero_shot@T0.0_D zero_shot@T0.2_C zero_shot@T0.2_D zero_shot@T1.0_C zero_shot@T1.0_D
llama-3.3-70b persuade 1-6 1.500 0.000 1.500 0.000 1.900 0.000 2.000 2.000 1.900 2.000 1.900 2.000 1.900 2.000 1.500 0.000 1.500 0.000 1.500 0.000
llama-3.3-70b turkish_movie_sentiment 0-5 0.929 1.071 0.929 0.929 0.929 0.929 1.000 - 0.950 1.083 0.950 1.250 1.058 0.917 0.929 0.929 0.929 0.786 1.014 0.929
llama-3.3-70b turkish_sentiment_large 0-2 0.386 0.250 0.386 0.250 0.357 0.250 0.000 - 0.371 0.286 0.371 0.286 0.357 0.286 0.350 0.250 0.350 0.250 0.350 0.375
llama-3.1-8b persuade 1-6 0.200 0.000 0.200 1.000 1.700 0.000 - - 1.500 2.000 1.500 2.000 1.500 2.000 0.500 1.000 0.200 1.000 1.500 1.000
llama-3.3-70b turkish_offensive 0-1 0.222 0.222 0.222 0.222 0.222 0.222 0.000 0.000 0.222 0.222 0.222 0.222 0.333 0.333 0.222 0.200 0.333 0.200 0.222 0.200
llama-3.1-8b turkish_sentiment_large 0-2 0.329 0.375 0.357 0.375 0.293 0.375 - - 0.329 0.286 0.336 0.429 0.343 0.429 0.312 0.375 0.325 0.375 0.325 0.375
llama-3.1-8b turkish_movie_sentiment 0-5 1.286 0.929 1.279 0.929 0.483 1.071 - - 1.292 0.917 1.083 0.917 1.075 1.250 1.250 0.929 1.050 0.929 0.931 0.786
llama-3.1-8b turkish_offensive 0-1 0.556 0.556 0.556 0.556 0.333 0.556 - - 0.556 0.556 0.444 0.556 0.444 0.444 0.556 0.500 0.556 0.556 0.333 0.556
llama-3.2-3b persuade 1-6 0.250 0.000 0.250 0.000 0.250 0.000 - - 1.500 2.000 1.500 1.000 0.500 0.000 0.500 0.000 0.500 0.000 0.500 1.000
llama-3.2-3b turkish_sentiment_large 0-2 0.500 0.500 0.571 0.625 0.619 0.750 - - 0.500 0.571 0.500 0.429 0.589 0.714 0.562 0.375 0.500 0.375 0.594 0.500
llama-3.2-3b turkish_movie_sentiment 0-5 0.714 0.643 0.750 0.643 1.250 0.929 - - 0.792 0.750 0.875 0.750 1.120 0.583 0.571 0.643 0.571 0.786 1.036 0.929
llama-3.2-3b turkish_offensive 0-1 0.667 0.667 0.556 0.667 0.556 0.667 - - 0.667 0.667 0.667 0.667 0.500 0.667 0.667 0.556 0.556 0.556 0.667 0.556
gemma-3-27b persuade 1-6 1.500 0.000 1.500 0.000 1.500 0.000 - - 1.900 2.000 1.900 2.000 1.900 2.000 1.500 1.000 1.500 1.000 1.150 1.000
gemma-3-27b turkish_movie_sentiment 0-5 0.975 0.929 1.017 0.929 1.017 0.929 - - 0.975 1.083 1.017 1.083 0.975 1.083 0.871 0.929 0.871 0.786 0.871 0.929
gemma-3-27b turkish_sentiment_large 0-2 0.400 0.250 0.400 0.250 0.407 0.250 - - 0.400 0.286 0.400 0.286 0.400 0.286 0.369 0.250 0.356 0.250 0.375 0.250
mistral-small persuade 1-6 1.500 1.000 1.500 1.000 1.500 1.000 - - 1.750 2.000 1.750 2.000 1.750 2.000 1.500 1.000 1.500 1.000 1.500 1.000
gemma-3-27b turkish_offensive 0-1 0.333 0.333 0.333 0.333 0.333 0.222 - - 0.333 0.333 0.333 0.333 0.333 0.333 0.333 0.333 0.333 0.333 0.333 0.333
mistral-small turkish_movie_sentiment 0-5 1.083 0.929 1.083 0.929 1.083 0.929 - - 1.083 0.917 1.083 0.917 1.083 0.917 1.000 0.929 1.000 0.929 1.000 0.929
mistral-small turkish_sentiment_large 0-2 0.407 0.375 0.407 0.375 0.407 0.375 - - 0.407 0.429 0.407 0.429 0.407 0.429 0.375 0.375 0.375 0.375 0.375 0.375
mistral-small turkish_offensive 0-1 0.333 0.333 0.333 0.333 0.333 0.333 - - 0.333 0.333 0.333 0.333 0.333 0.333 0.333 0.333 0.333 0.333 0.333 0.333
gpt-4o-mini persuade 1-6 0.000 0.000 0.000 0.000 0.000 0.000 - - 2.000 2.000 2.000 2.000 1.500 2.000 0.500 1.000 0.500 1.000 1.000 1.000
gpt-4o-mini turkish_movie_sentiment 0-5 0.900 1.071 0.900 1.071 1.017 1.214 - - 1.017 1.083 1.050 1.083 0.967 1.083 0.829 1.071 0.829 1.071 0.886 1.071
gpt-4o-mini turkish_offensive 0-1 0.333 0.333 0.333 0.333 0.333 0.333 - - 0.333 0.333 0.333 0.333 0.333 0.333 0.333 0.333 0.333 0.333 0.333 0.333
hermes-3-405b persuade 1-6 0.500 0.000 0.500 0.000 0.500 0.000 - - 1.500 2.000 1.500 2.000 2.000 2.000 0.500 0.000 0.500 0.000 0.500 0.000
hermes-3-405b turkish_movie_sentiment 0-5 0.717 0.786 0.717 0.786 0.792 0.786 - - 0.717 0.917 0.717 0.917 0.800 0.917 0.650 0.786 0.650 0.929 0.757 0.929
hermes-3-405b turkish_offensive 0-1 0.222 0.222 0.222 0.222 0.222 0.222 - - 0.222 0.222 0.222 0.222 0.222 0.333 0.222 0.222 0.222 0.222 0.333 0.222
gpt-4o-mini turkish_sentiment_large 0-2 0.357 0.250 0.379 0.250 0.386 0.250 - - 0.379 0.286 0.379 0.286 0.264 0.286 0.356 0.250 0.337 0.250 0.356 0.250
hermes-3-405b turkish_sentiment_large 0-2 0.379 0.375 0.379 0.375 0.450 0.375 - - 0.393 0.429 0.393 0.429 0.414 0.429 0.406 0.375 0.375 0.375 0.431 0.375
llama-3.1-8b brighter_emotion 0-3 0.688 0.625 0.625 0.625 0.606 0.625 - - 0.719 0.625 0.719 0.625 0.581 0.875 0.625 0.625 0.662 0.500 0.738 0.750
llama-3.3-70b brighter_emotion 0-3 0.644 0.500 0.650 0.625 0.644 0.750 0.000 - 0.644 0.750 0.644 0.625 0.725 0.625 0.650 0.625 0.650 0.750 0.650 0.625
llama-3.2-3b brighter_emotion 0-3 0.656 0.500 0.594 0.625 0.531 0.250 - - 0.625 0.625 0.594 0.625 0.875 1.000 0.656 0.500 0.844 0.625 0.562 0.500
gemma-3-27b brighter_emotion 0-3 0.569 0.625 0.588 0.625 0.594 0.625 - - 0.569 0.625 0.538 0.625 0.625 0.500 0.581 0.625 0.606 0.625 0.538 0.625
mistral-small brighter_emotion 0-3 0.512 0.375 0.512 0.375 0.512 0.375 - - 0.512 0.375 0.536 0.375 0.550 0.375 0.512 0.375 0.512 0.375 0.500 0.375
gpt-4o-mini brighter_emotion 0-3 0.588 0.625 0.594 0.625 0.588 0.875 - - 0.557 0.625 0.564 0.750 0.571 0.750 0.562 0.625 0.594 0.625 0.650 0.375
hermes-3-405b brighter_emotion 0-3 0.494 0.375 0.406 0.500 0.506 0.625 - - 0.529 0.375 0.600 0.375 0.671 0.375 0.531 0.375 0.556 0.375 0.488 0.375
Model Dataset Range few_shot@T0.0_C few_shot@T0.0_D few_shot@T0.2_C few_shot@T0.2_D few_shot@T1.0_C few_shot@T1.0_D rag_rubric@T0.0_D rag_rubric@T0.2_D rubric_based@T0.0_C rubric_based@T0.0_D rubric_based@T0.2_C rubric_based@T0.2_D rubric_based@T1.0_C rubric_based@T1.0_D zero_shot@T0.0_C zero_shot@T0.0_D zero_shot@T0.2_C zero_shot@T0.2_D zero_shot@T1.0_C zero_shot@T1.0_D
llama-3.3-70b persuade 1-6 0.0% 100.0% 0.0% 100.0% 0.0% 100.0% 0.0% 0.0% 0.0% 0.0% 0.0% 0.0% 0.0% 0.0% 0.0% 100.0% 0.0% 100.0% 0.0% 100.0%
llama-3.3-70b turkish_movie_sentiment 0-5 0.0% 0.0% 0.0% 14.3% 0.0% 14.3% 0.0% - 0.0% 0.0% 0.0% 0.0% 0.0% 16.7% 0.0% 14.3% 0.0% 28.6% 0.0% 14.3%
llama-3.3-70b turkish_sentiment_large 0-2 71.4% 75.0% 71.4% 75.0% 71.4% 75.0% 100.0% - 71.4% 71.4% 71.4% 71.4% 71.4% 71.4% 75.0% 75.0% 75.0% 75.0% 75.0% 75.0%
llama-3.1-8b persuade 1-6 100.0% 100.0% 100.0% 0.0% 0.0% 100.0% - - 0.0% 0.0% 0.0% 0.0% 0.0% 0.0% 100.0% 0.0% 100.0% 0.0% 0.0% 0.0%
llama-3.3-70b turkish_offensive 0-1 77.8% 77.8% 77.8% 77.8% 77.8% 77.8% 100.0% 100.0% 77.8% 77.8% 77.8% 77.8% 66.7% 66.7% 77.8% 80.0% 66.7% 80.0% 77.8% 80.0%
llama-3.1-8b turkish_sentiment_large 0-2 71.4% 62.5% 71.4% 62.5% 71.4% 62.5% - - 71.4% 71.4% 71.4% 57.1% 71.4% 57.1% 75.0% 62.5% 75.0% 62.5% 75.0% 75.0%
llama-3.1-8b turkish_movie_sentiment 0-5 28.6% 28.6% 28.6% 28.6% 50.0% 14.3% - - 0.0% 33.3% 16.7% 33.3% 0.0% 0.0% 42.9% 28.6% 28.6% 28.6% 0.0% 42.9%
llama-3.1-8b turkish_offensive 0-1 44.4% 44.4% 44.4% 44.4% 66.7% 44.4% - - 44.4% 44.4% 55.6% 44.4% 55.6% 55.6% 44.4% 50.0% 44.4% 44.4% 66.7% 44.4%
llama-3.2-3b persuade 1-6 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% - - 0.0% 0.0% 0.0% 0.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 0.0%
llama-3.2-3b turkish_sentiment_large 0-2 71.4% 50.0% 71.4% 37.5% 57.1% 37.5% - - 71.4% 42.9% 71.4% 57.1% 57.1% 28.6% 75.0% 62.5% 75.0% 62.5% 37.5% 50.0%
llama-3.2-3b turkish_movie_sentiment 0-5 28.6% 28.6% 33.3% 28.6% 16.7% 28.6% - - 16.7% 16.7% 16.7% 16.7% 0.0% 33.3% 42.9% 28.6% 42.9% 28.6% 28.6% 14.3%
llama-3.2-3b turkish_offensive 0-1 33.3% 33.3% 44.4% 33.3% 44.4% 33.3% - - 33.3% 33.3% 33.3% 33.3% 50.0% 33.3% 33.3% 44.4% 44.4% 44.4% 33.3% 44.4%
gemma-3-27b persuade 1-6 0.0% 100.0% 0.0% 100.0% 0.0% 100.0% - - 0.0% 0.0% 0.0% 0.0% 0.0% 0.0% 0.0% 0.0% 0.0% 0.0% 0.0% 0.0%
gemma-3-27b turkish_movie_sentiment 0-5 16.7% 28.6% 16.7% 28.6% 16.7% 28.6% - - 16.7% 16.7% 16.7% 16.7% 16.7% 16.7% 28.6% 28.6% 28.6% 28.6% 42.9% 28.6%
gemma-3-27b turkish_sentiment_large 0-2 71.4% 75.0% 71.4% 75.0% 71.4% 75.0% - - 71.4% 71.4% 71.4% 71.4% 71.4% 71.4% 75.0% 75.0% 75.0% 75.0% 75.0% 75.0%
mistral-small persuade 1-6 0.0% 0.0% 0.0% 0.0% 0.0% 0.0% - - 0.0% 0.0% 0.0% 0.0% 0.0% 0.0% 0.0% 0.0% 0.0% 0.0% 0.0% 0.0%
gemma-3-27b turkish_offensive 0-1 66.7% 66.7% 66.7% 66.7% 66.7% 77.8% - - 66.7% 66.7% 66.7% 66.7% 66.7% 66.7% 66.7% 66.7% 66.7% 66.7% 66.7% 66.7%
mistral-small turkish_movie_sentiment 0-5 16.7% 14.3% 16.7% 14.3% 16.7% 14.3% - - 16.7% 16.7% 16.7% 16.7% 16.7% 16.7% 28.6% 14.3% 28.6% 14.3% 28.6% 14.3%
mistral-small turkish_sentiment_large 0-2 71.4% 75.0% 71.4% 75.0% 71.4% 75.0% - - 71.4% 71.4% 71.4% 71.4% 71.4% 71.4% 75.0% 75.0% 75.0% 75.0% 75.0% 75.0%
mistral-small turkish_offensive 0-1 66.7% 66.7% 66.7% 66.7% 66.7% 66.7% - - 66.7% 66.7% 66.7% 66.7% 66.7% 66.7% 66.7% 66.7% 66.7% 66.7% 66.7% 66.7%
gpt-4o-mini persuade 1-6 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% - - 0.0% 0.0% 0.0% 0.0% 0.0% 0.0% 100.0% 0.0% 100.0% 0.0% 0.0% 0.0%
gpt-4o-mini turkish_movie_sentiment 0-5 16.7% 14.3% 16.7% 14.3% 16.7% 0.0% - - 16.7% 16.7% 16.7% 16.7% 16.7% 16.7% 28.6% 14.3% 28.6% 14.3% 28.6% 0.0%
gpt-4o-mini turkish_offensive 0-1 66.7% 66.7% 66.7% 66.7% 66.7% 66.7% - - 66.7% 66.7% 66.7% 66.7% 66.7% 66.7% 66.7% 66.7% 66.7% 66.7% 66.7% 66.7%
hermes-3-405b persuade 1-6 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% - - 0.0% 0.0% 0.0% 0.0% 0.0% 0.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0%
hermes-3-405b turkish_movie_sentiment 0-5 16.7% 28.6% 16.7% 28.6% 16.7% 28.6% - - 16.7% 16.7% 16.7% 16.7% 16.7% 16.7% 28.6% 28.6% 28.6% 14.3% 28.6% 28.6%
hermes-3-405b turkish_offensive 0-1 77.8% 77.8% 77.8% 77.8% 77.8% 77.8% - - 77.8% 77.8% 77.8% 77.8% 77.8% 66.7% 77.8% 77.8% 77.8% 77.8% 66.7% 77.8%
gpt-4o-mini turkish_sentiment_large 0-2 71.4% 75.0% 71.4% 75.0% 71.4% 75.0% - - 71.4% 71.4% 71.4% 71.4% 71.4% 71.4% 75.0% 75.0% 75.0% 75.0% 75.0% 75.0%
hermes-3-405b turkish_sentiment_large 0-2 71.4% 75.0% 71.4% 75.0% 71.4% 75.0% - - 71.4% 71.4% 71.4% 71.4% 71.4% 71.4% 75.0% 75.0% 75.0% 75.0% 75.0% 75.0%
llama-3.1-8b brighter_emotion 0-3 0.0% 50.0% 25.0% 50.0% 25.0% 37.5% - - 0.0% 50.0% 12.5% 50.0% 37.5% 37.5% 12.5% 50.0% 12.5% 50.0% 37.5% 37.5%
llama-3.3-70b brighter_emotion 0-3 25.0% 50.0% 25.0% 37.5% 25.0% 25.0% 100.0% - 25.0% 25.0% 25.0% 37.5% 12.5% 37.5% 25.0% 37.5% 25.0% 25.0% 25.0% 37.5%
llama-3.2-3b brighter_emotion 0-3 50.0% 50.0% 50.0% 37.5% 62.5% 75.0% - - 50.0% 37.5% 50.0% 37.5% 50.0% 25.0% 50.0% 50.0% 12.5% 37.5% 62.5% 62.5%
gemma-3-27b brighter_emotion 0-3 62.5% 50.0% 62.5% 50.0% 62.5% 50.0% - - 62.5% 50.0% 62.5% 50.0% 62.5% 50.0% 62.5% 50.0% 62.5% 50.0% 62.5% 50.0%
mistral-small brighter_emotion 0-3 50.0% 62.5% 50.0% 62.5% 50.0% 62.5% - - 50.0% 62.5% 42.9% 62.5% 42.9% 62.5% 50.0% 62.5% 50.0% 62.5% 50.0% 62.5%
gpt-4o-mini brighter_emotion 0-3 25.0% 37.5% 25.0% 37.5% 25.0% 12.5% - - 28.6% 37.5% 28.6% 25.0% 28.6% 25.0% 25.0% 37.5% 25.0% 37.5% 25.0% 62.5%
hermes-3-405b brighter_emotion 0-3 50.0% 62.5% 62.5% 50.0% 75.0% 37.5% - - 42.9% 62.5% 28.6% 62.5% 28.6% 62.5% 50.0% 62.5% 37.5% 62.5% 62.5% 62.5%
Model Dataset Range few_shot@T0.0_C few_shot@T0.0_D few_shot@T0.2_C few_shot@T0.2_D few_shot@T1.0_C few_shot@T1.0_D rag_rubric@T0.0_D rag_rubric@T0.2_D rubric_based@T0.0_C rubric_based@T0.0_D rubric_based@T0.2_C rubric_based@T0.2_D rubric_based@T1.0_C rubric_based@T1.0_D zero_shot@T0.0_C zero_shot@T0.0_D zero_shot@T0.2_C zero_shot@T0.2_D zero_shot@T1.0_C zero_shot@T1.0_D
llama-3.3-70b persuade 1-6 0.0% 100.0% 0.0% 100.0% 0.0% 100.0% 0.0% 0.0% 0.0% 0.0% 0.0% 0.0% 0.0% 0.0% 0.0% 100.0% 0.0% 100.0% 0.0% 100.0%
llama-3.3-70b turkish_movie_sentiment 0-5 71.4% 85.7% 71.4% 85.7% 71.4% 85.7% 100.0% - 66.7% 83.3% 66.7% 66.7% 50.0% 83.3% 71.4% 85.7% 71.4% 85.7% 57.1% 85.7%
llama-3.3-70b turkish_sentiment_large 0-2 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% - 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 87.5%
llama-3.1-8b persuade 1-6 100.0% 100.0% 100.0% 100.0% 0.0% 100.0% - - 0.0% 0.0% 0.0% 0.0% 0.0% 0.0% 100.0% 100.0% 100.0% 100.0% 0.0% 100.0%
llama-3.3-70b turkish_offensive 0-1 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0%
llama-3.1-8b turkish_sentiment_large 0-2 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% - - 100.0% 100.0% 100.0% 100.0% 85.7% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 87.5%
llama-3.1-8b turkish_movie_sentiment 0-5 57.1% 85.7% 42.9% 85.7% 100.0% 85.7% - - 50.0% 83.3% 50.0% 83.3% 66.7% 83.3% 42.9% 85.7% 57.1% 85.7% 57.1% 85.7%
llama-3.1-8b turkish_offensive 0-1 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% - - 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0%
llama-3.2-3b persuade 1-6 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% - - 0.0% 0.0% 0.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0%
llama-3.2-3b turkish_sentiment_large 0-2 100.0% 100.0% 100.0% 100.0% 100.0% 87.5% - - 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0%
llama-3.2-3b turkish_movie_sentiment 0-5 71.4% 100.0% 66.7% 100.0% 33.3% 71.4% - - 66.7% 100.0% 66.7% 100.0% 50.0% 100.0% 85.7% 100.0% 85.7% 85.7% 57.1% 85.7%
llama-3.2-3b turkish_offensive 0-1 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% - - 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0%
gemma-3-27b persuade 1-6 0.0% 100.0% 0.0% 100.0% 0.0% 100.0% - - 0.0% 0.0% 0.0% 0.0% 0.0% 0.0% 0.0% 100.0% 0.0% 100.0% 0.0% 100.0%
gemma-3-27b turkish_movie_sentiment 0-5 66.7% 85.7% 50.0% 85.7% 50.0% 85.7% - - 66.7% 83.3% 50.0% 83.3% 66.7% 83.3% 71.4% 85.7% 71.4% 100.0% 57.1% 85.7%
gemma-3-27b turkish_sentiment_large 0-2 85.7% 100.0% 85.7% 100.0% 85.7% 100.0% - - 85.7% 100.0% 85.7% 100.0% 85.7% 100.0% 87.5% 100.0% 87.5% 100.0% 87.5% 100.0%
mistral-small persuade 1-6 0.0% 100.0% 0.0% 100.0% 0.0% 100.0% - - 0.0% 0.0% 0.0% 0.0% 0.0% 0.0% 0.0% 100.0% 0.0% 100.0% 0.0% 100.0%
gemma-3-27b turkish_offensive 0-1 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% - - 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0%
mistral-small turkish_movie_sentiment 0-5 33.3% 85.7% 33.3% 85.7% 33.3% 85.7% - - 33.3% 83.3% 33.3% 83.3% 33.3% 83.3% 42.9% 85.7% 42.9% 85.7% 42.9% 85.7%
mistral-small turkish_sentiment_large 0-2 85.7% 87.5% 85.7% 87.5% 85.7% 87.5% - - 85.7% 85.7% 85.7% 85.7% 85.7% 85.7% 87.5% 87.5% 87.5% 87.5% 87.5% 87.5%
mistral-small turkish_offensive 0-1 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% - - 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0%
gpt-4o-mini persuade 1-6 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% - - 0.0% 0.0% 0.0% 0.0% 0.0% 0.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0%
gpt-4o-mini turkish_movie_sentiment 0-5 83.3% 85.7% 83.3% 85.7% 66.7% 71.4% - - 66.7% 83.3% 50.0% 83.3% 50.0% 83.3% 85.7% 85.7% 85.7% 85.7% 85.7% 85.7%
gpt-4o-mini turkish_offensive 0-1 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% - - 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0%
hermes-3-405b persuade 1-6 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% - - 0.0% 0.0% 0.0% 0.0% 0.0% 0.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0%
hermes-3-405b turkish_movie_sentiment 0-5 83.3% 100.0% 83.3% 100.0% 83.3% 100.0% - - 83.3% 100.0% 83.3% 100.0% 83.3% 100.0% 85.7% 100.0% 85.7% 100.0% 71.4% 85.7%
hermes-3-405b turkish_offensive 0-1 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% - - 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0%
gpt-4o-mini turkish_sentiment_large 0-2 100.0% 100.0% 100.0% 100.0% 85.7% 100.0% - - 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0%
hermes-3-405b turkish_sentiment_large 0-2 85.7% 87.5% 85.7% 87.5% 85.7% 87.5% - - 85.7% 85.7% 85.7% 85.7% 85.7% 85.7% 87.5% 87.5% 87.5% 87.5% 87.5% 87.5%
llama-3.1-8b brighter_emotion 0-3 87.5% 87.5% 87.5% 87.5% 87.5% 100.0% - - 87.5% 87.5% 75.0% 87.5% 75.0% 87.5% 87.5% 87.5% 87.5% 100.0% 62.5% 87.5%
llama-3.3-70b brighter_emotion 0-3 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% - 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0%
llama-3.2-3b brighter_emotion 0-3 87.5% 100.0% 87.5% 100.0% 87.5% 100.0% - - 87.5% 100.0% 87.5% 100.0% 62.5% 75.0% 87.5% 100.0% 75.0% 100.0% 75.0% 87.5%
gemma-3-27b brighter_emotion 0-3 87.5% 87.5% 87.5% 87.5% 75.0% 87.5% - - 87.5% 87.5% 87.5% 87.5% 75.0% 100.0% 87.5% 87.5% 75.0% 87.5% 75.0% 87.5%
mistral-small brighter_emotion 0-3 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% - - 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0% 100.0%
gpt-4o-mini brighter_emotion 0-3 87.5% 100.0% 87.5% 100.0% 87.5% 100.0% - - 85.7% 100.0% 85.7% 100.0% 85.7% 100.0% 87.5% 100.0% 87.5% 100.0% 87.5% 100.0%
hermes-3-405b brighter_emotion 0-3 87.5% 100.0% 87.5% 100.0% 87.5% 100.0% - - 85.7% 100.0% 85.7% 100.0% 71.4% 100.0% 87.5% 100.0% 87.5% 100.0% 75.0% 100.0%
Model Dataset Range few_shot@T0.0_C few_shot@T0.0_D few_shot@T0.2_C few_shot@T0.2_D few_shot@T1.0_C few_shot@T1.0_D rag_rubric@T0.0_D rag_rubric@T0.2_D rubric_based@T0.0_C rubric_based@T0.0_D rubric_based@T0.2_C rubric_based@T0.2_D rubric_based@T1.0_C rubric_based@T1.0_D zero_shot@T0.0_C zero_shot@T0.0_D zero_shot@T0.2_C zero_shot@T0.2_D zero_shot@T1.0_C zero_shot@T1.0_D
llama-3.3-70b persuade 1-6 - - - - - - - - - - - - - - - - - - - -
llama-3.3-70b turkish_movie_sentiment 0-5 0.639 0.639 0.639 0.693 0.639 0.693 - - 0.649 0.649 0.649 0.561 0.649 0.707 0.639 0.693 0.639 0.741 0.639 0.693
llama-3.3-70b turkish_sentiment_large 0-2 0.650 0.667 0.650 0.667 0.650 0.667 - - 0.650 0.650 0.650 0.650 0.650 0.650 0.667 0.667 0.667 0.667 0.667 0.412
llama-3.1-8b persuade 1-6 - - - - - - - - - - - - - - - - - - - -
llama-3.3-70b turkish_offensive 0-1 0.571 0.571 0.571 0.571 0.571 0.571 - - 0.571 0.571 0.571 0.571 0.341 0.341 0.571 0.615 0.341 0.615 0.571 0.615
llama-3.1-8b turkish_sentiment_large 0-2 0.650 0.455 0.650 0.647 0.650 0.455 - - 0.650 0.650 0.650 0.632 0.650 0.432 0.667 0.455 0.667 0.455 0.667 0.412
llama-3.1-8b turkish_movie_sentiment 0-5 -0.140 0.305 -0.140 0.305 0.847 0.183 - - -0.320 0.308 0.170 0.308 0.000 0.087 -0.186 0.305 0.000 0.305 0.639 0.300
llama-3.1-8b turkish_offensive 0-1 -0.154 -0.154 -0.154 -0.154 0.341 -0.154 - - -0.154 -0.154 0.100 -0.154 0.100 0.100 -0.154 -0.087 -0.154 -0.154 0.341 -0.154
llama-3.2-3b persuade 1-6 - - - - - - - - - - - - - - - - - - - -
llama-3.2-3b turkish_sentiment_large 0-2 0.000 0.429 0.588 0.333 0.512 0.200 - - 0.000 0.391 0.000 0.512 0.432 0.444 0.600 0.538 0.000 0.538 -0.429 0.333
llama-3.2-3b turkish_movie_sentiment 0-5 0.263 0.653 0.700 0.653 0.057 0.330 - - 0.250 0.651 0.250 0.651 0.053 0.760 0.708 0.653 0.708 0.500 0.423 0.323
llama-3.2-3b turkish_offensive 0-1 -0.421 -0.421 -0.154 -0.421 -0.154 -0.421 - - -0.421 -0.421 -0.421 -0.421 -0.067 -0.421 -0.421 -0.154 -0.154 -0.154 -0.421 -0.154
gemma-3-27b persuade 1-6 - - - - - - - - - - - - - - - - - - - -
gemma-3-27b turkish_movie_sentiment 0-5 0.717 0.569 0.717 0.569 0.717 0.569 - - 0.717 0.556 0.717 0.556 0.717 0.556 0.724 0.569 0.724 0.724 0.708 0.569
gemma-3-27b turkish_sentiment_large 0-2 0.650 0.667 0.650 0.667 0.650 0.667 - - 0.650 0.650 0.650 0.650 0.650 0.650 0.667 0.667 0.667 0.667 0.667 0.667
mistral-small persuade 1-6 - - - - - - - - - - - - - - - - - - - -
gemma-3-27b turkish_offensive 0-1 0.341 0.341 0.341 0.341 0.341 0.571 - - 0.341 0.341 0.341 0.341 0.341 0.341 0.341 0.341 0.341 0.341 0.341 0.341
mistral-small turkish_movie_sentiment 0-5 0.700 0.594 0.700 0.594 0.700 0.594 - - 0.700 0.600 0.700 0.600 0.700 0.600 0.708 0.594 0.708 0.594 0.708 0.594
mistral-small turkish_sentiment_large 0-2 0.650 0.412 0.650 0.412 0.650 0.412 - - 0.650 0.386 0.650 0.386 0.650 0.386 0.667 0.412 0.667 0.412 0.667 0.412
mistral-small turkish_offensive 0-1 0.341 0.341 0.341 0.341 0.341 0.341 - - 0.341 0.341 0.341 0.341 0.341 0.341 0.341 0.341 0.341 0.341 0.341 0.341
gpt-4o-mini persuade 1-6 - - - - - - - - - - - - - - - - - - - -
gpt-4o-mini turkish_movie_sentiment 0-5 0.717 0.656 0.717 0.656 0.667 0.596 - - 0.667 0.667 0.667 0.667 0.667 0.667 0.674 0.656 0.674 0.656 0.674 0.639
gpt-4o-mini turkish_offensive 0-1 0.341 0.341 0.341 0.341 0.341 0.341 - - 0.341 0.341 0.341 0.341 0.341 0.341 0.341 0.341 0.341 0.341 0.341 0.341
hermes-3-405b persuade 1-6 - - - - - - - - - - - - - - - - - - - -
hermes-3-405b turkish_movie_sentiment 0-5 0.717 0.724 0.717 0.724 0.717 0.724 - - 0.717 0.717 0.717 0.717 0.717 0.717 0.724 0.724 0.724 0.696 0.724 0.674
hermes-3-405b turkish_offensive 0-1 0.571 0.571 0.571 0.571 0.571 0.571 - - 0.571 0.571 0.571 0.571 0.571 0.341 0.571 0.571 0.571 0.571 0.341 0.571
gpt-4o-mini turkish_sentiment_large 0-2 0.650 0.667 0.650 0.667 0.650 0.667 - - 0.650 0.650 0.650 0.650 0.650 0.650 0.667 0.667 0.667 0.667 0.667 0.667
hermes-3-405b turkish_sentiment_large 0-2 0.650 0.412 0.650 0.412 0.386 0.412 - - 0.650 0.386 0.650 0.386 0.386 0.386 0.412 0.412 0.667 0.412 0.412 0.412
llama-3.1-8b brighter_emotion 0-3 0.458 0.600 0.510 0.600 0.510 0.688 - - 0.458 0.600 0.576 0.600 0.688 0.297 0.533 0.600 0.576 0.775 0.347 0.492
llama-3.3-70b brighter_emotion 0-3 0.680 0.775 0.680 0.722 0.680 0.631 - - 0.680 0.631 0.680 0.722 0.576 0.688 0.680 0.722 0.680 0.631 0.680 0.688
llama-3.2-3b brighter_emotion 0-3 0.667 0.792 0.702 0.744 0.718 0.869 - - 0.792 0.773 0.702 0.744 0.216 0.360 0.702 0.816 0.381 0.773 0.642 0.688
gemma-3-27b brighter_emotion 0-3 0.467 0.391 0.467 0.391 0.467 0.391 - - 0.467 0.391 0.467 0.391 0.467 0.660 0.467 0.391 0.467 0.391 0.739 0.391
mistral-small brighter_emotion 0-3 0.719 0.854 0.719 0.854 0.719 0.854 - - 0.719 0.854 0.708 0.854 0.708 0.854 0.719 0.854 0.719 0.854 0.719 0.854
gpt-4o-mini brighter_emotion 0-3 0.529 0.688 0.529 0.688 0.529 0.440 - - 0.578 0.688 0.578 0.579 0.578 0.579 0.529 0.688 0.529 0.688 0.529 0.778
hermes-3-405b brighter_emotion 0-3 0.719 0.806 0.786 0.746 0.873 0.677 - - 0.708 0.806 0.632 0.806 0.632 0.806 0.719 0.806 0.643 0.806 0.806 0.806
Model Dataset Range few_shot@T0.0_C few_shot@T0.0_D few_shot@T0.2_C few_shot@T0.2_D few_shot@T1.0_C few_shot@T1.0_D rag_rubric@T0.0_D rag_rubric@T0.2_D rubric_based@T0.0_C rubric_based@T0.0_D rubric_based@T0.2_C rubric_based@T0.2_D rubric_based@T1.0_C rubric_based@T1.0_D zero_shot@T0.0_C zero_shot@T0.0_D zero_shot@T0.2_C zero_shot@T0.2_D zero_shot@T1.0_C zero_shot@T1.0_D
llama-3.3-70b persuade 1-6 - - - - - - - - - - - - - - - - - - - -
llama-3.3-70b turkish_movie_sentiment 0-5 0.553 0.553 0.553 0.705 0.553 0.705 - - 0.657 0.657 0.657 0.409 0.716 0.773 0.553 0.705 0.553 0.903 0.624 0.705
llama-3.3-70b turkish_sentiment_large 0-2 0.746 0.800 0.746 0.800 0.725 0.800 - - 0.746 0.794 0.731 0.794 0.725 0.794 0.719 0.800 0.743 0.800 0.743 0.655
llama-3.1-8b persuade 1-6 - - - - - - - - - - - - - - - - - - - -
llama-3.3-70b turkish_offensive 0-1 0.632 0.632 0.632 0.632 0.632 0.632 - - 0.632 0.632 0.632 0.632 0.350 0.350 0.632 0.667 0.350 0.667 0.632 0.667
llama-3.1-8b turkish_sentiment_large 0-2 0.798 0.412 0.798 0.873 0.820 0.412 - - 0.798 0.794 0.791 0.882 0.638 0.394 0.775 0.412 0.756 0.412 0.756 0.655
llama-3.1-8b turkish_movie_sentiment 0-5 0.000 0.476 -0.101 0.476 0.971 0.152 - - -0.273 0.418 0.000 0.418 -0.090 0.091 0.009 0.476 0.093 0.476 0.546 0.456
llama-3.1-8b turkish_offensive 0-1 -0.158 -0.158 -0.158 -0.158 0.350 -0.158 - - -0.158 -0.158 0.100 -0.158 0.100 0.100 -0.158 -0.089 -0.158 -0.158 0.350 -0.158
llama-3.2-3b persuade 1-6 - - - - - - - - - - - - - - - - - - - -
llama-3.2-3b turkish_sentiment_large 0-2 0.000 0.621 0.645 0.582 0.580 0.408 - - 0.000 0.617 0.000 0.683 0.394 0.837 0.655 0.690 0.000 0.690 -0.270 0.291
llama-3.2-3b turkish_movie_sentiment 0-5 0.680 0.734 0.746 0.734 -0.136 0.353 - - 0.469 0.682 0.469 0.682 0.179 0.739 0.748 0.734 0.725 0.505 0.339 0.353
llama-3.2-3b turkish_offensive 0-1 -0.478 -0.478 -0.158 -0.478 -0.158 -0.478 - - -0.478 -0.478 -0.478 -0.478 -0.067 -0.478 -0.478 -0.158 -0.158 -0.158 -0.478 -0.158
gemma-3-27b persuade 1-6 - - - - - - - - - - - - - - - - - - - -
gemma-3-27b turkish_movie_sentiment 0-5 0.441 0.502 0.403 0.502 0.403 0.502 - - 0.441 0.563 0.403 0.563 0.441 0.563 0.385 0.502 0.385 0.748 0.321 0.502
gemma-3-27b turkish_sentiment_large 0-2 0.638 0.800 0.638 0.800 0.638 0.800 - - 0.638 0.794 0.638 0.794 0.638 0.794 0.645 0.800 0.645 0.800 0.645 0.800
mistral-small persuade 1-6 - - - - - - - - - - - - - - - - - - - -
gemma-3-27b turkish_offensive 0-1 0.350 0.350 0.350 0.350 0.350 0.632 - - 0.350 0.350 0.350 0.350 0.350 0.350 0.350 0.350 0.350 0.350 0.350 0.350
mistral-small turkish_movie_sentiment 0-5 0.806 0.611 0.806 0.611 0.806 0.611 - - 0.806 0.716 0.806 0.716 0.806 0.716 0.685 0.611 0.685 0.611 0.685 0.611
mistral-small turkish_sentiment_large 0-2 0.656 0.655 0.656 0.655 0.656 0.655 - - 0.656 0.617 0.656 0.617 0.656 0.617 0.650 0.655 0.650 0.655 0.650 0.655
mistral-small turkish_offensive 0-1 0.350 0.350 0.350 0.350 0.350 0.350 - - 0.350 0.350 0.350 0.350 0.350 0.350 0.350 0.350 0.350 0.350 0.350 0.350
gpt-4o-mini persuade 1-6 - - - - - - - - - - - - - - - - - - - -
gpt-4o-mini turkish_movie_sentiment 0-5 0.773 0.572 0.773 0.572 0.563 0.553 - - 0.773 0.783 0.563 0.783 0.672 0.783 0.705 0.572 0.705 0.572 0.705 0.553
gpt-4o-mini turkish_offensive 0-1 0.350 0.350 0.350 0.350 0.350 0.350 - - 0.350 0.350 0.350 0.350 0.350 0.350 0.350 0.350 0.350 0.350 0.350 0.350
hermes-3-405b persuade 1-6 - - - - - - - - - - - - - - - - - - - -
hermes-3-405b turkish_movie_sentiment 0-5 0.773 0.748 0.773 0.748 0.806 0.748 - - 0.773 0.783 0.773 0.783 0.773 0.783 0.705 0.748 0.705 0.572 0.587 0.748
hermes-3-405b turkish_offensive 0-1 0.632 0.632 0.632 0.632 0.632 0.632 - - 0.632 0.632 0.632 0.632 0.632 0.350 0.632 0.632 0.632 0.632 0.350 0.632
gpt-4o-mini turkish_sentiment_large 0-2 0.746 0.800 0.746 0.800 0.638 0.800 - - 0.746 0.794 0.746 0.794 0.798 0.794 0.743 0.800 0.743 0.800 0.743 0.800
hermes-3-405b turkish_sentiment_large 0-2 0.656 0.655 0.656 0.655 0.644 0.655 - - 0.644 0.617 0.644 0.617 0.580 0.617 0.588 0.655 0.650 0.655 0.650 0.655
llama-3.1-8b brighter_emotion 0-3 0.654 0.721 0.732 0.721 0.833 0.749 - - 0.650 0.721 0.650 0.721 0.865 0.413 0.721 0.721 0.581 0.833 0.584 0.556
llama-3.3-70b brighter_emotion 0-3 0.815 0.833 0.815 0.734 0.815 0.660 - - 0.815 0.660 0.815 0.734 0.750 0.749 0.815 0.734 0.815 0.660 0.815 0.749
llama-3.2-3b brighter_emotion 0-3 0.726 0.876 0.790 0.795 0.790 0.901 - - 0.900 0.844 0.790 0.795 0.089 0.497 0.779 0.883 0.608 0.844 0.575 0.680
gemma-3-27b brighter_emotion 0-3 0.650 0.392 0.646 0.392 0.525 0.392 - - 0.650 0.392 0.730 0.392 0.522 0.683 0.503 0.392 0.596 0.392 0.698 0.392
mistral-small brighter_emotion 0-3 0.833 0.917 0.833 0.917 0.833 0.917 - - 0.833 0.917 0.819 0.917 0.774 0.917 0.833 0.917 0.833 0.917 0.869 0.917
gpt-4o-mini brighter_emotion 0-3 0.728 0.636 0.636 0.636 0.728 0.528 - - 0.731 0.636 0.686 0.576 0.591 0.576 0.716 0.636 0.636 0.636 0.546 0.761
hermes-3-405b brighter_emotion 0-3 0.772 0.859 0.895 0.737 0.895 0.699 - - 0.680 0.859 0.661 0.859 0.548 0.859 0.869 0.859 0.725 0.859 0.823 0.859

📚 Per-Question Model Comparison

# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 I believe that technology has improved our lives significant... 4.0 3.04.05.04.04.03.03.0 The statement is simple and clear, but l...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 I believe that technology has improved our lives significant... 4.0 3.04.05.05.04.03.03.0 The statement is clear and grammatically...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 oyunculuk güzel ama sadece oyunculuk... konusunu hiç beğenme... 3.0 2.02.02.02.02.02.02.0 Metinde hem olumlu ("oyunculuk güze...
2 Basit bir senaryo,güzel oyunculuk,güzel film sıkmıyor...... 3.0 4.04.04.04.04.03.04.0 Metin genel olarak olumlu bir duygu ifad...
3 Sağlam bir uyarlama...... 5.0 3.04.05.03.05.03.04.0 Metin, "Sağlam bir uyarlama..."...
4 Yıllardır arşivimde beklerdi birkaç gün önce izledim ve nede... 5.0 4.04.05.04.04.04.04.0 Metindeki genel duygu durumu olumluya ya...
5 Ben filmi beğendim,özellikle 3d izlemenizi tavsiye ediyorum.... 4.5 4.04.05.05.05.05.05.0 Metin genel olarak pozitif bir duygu dur...
6 Film demeye bile zorlanabileceğiniz bir aksiyon çukuru...10/... 2.0 1.00.05.03.00.01.00.0 Metin genel olarak olumsuz bir duygu ifa...
7 güzel aşk filmlerinden biri.. unutulamayan aşklara çizik çek... 4.0 4.04.04.04.05.05.05.0 Metin genel olarak olumlu bir duygu duru...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 oyunculuk güzel ama sadece oyunculuk... konusunu hiç beğenme... 3.0 2.02.02.02.02.02.02.0 Metinde hem olumlu ("oyunculuk güze...
2 Basit bir senaryo,güzel oyunculuk,güzel film sıkmıyor...... 3.0 4.04.04.04.04.03.04.0 Metin genel olarak olumlu bir duygu duru...
3 Sağlam bir uyarlama...... 5.0 3.04.05.05.05.03.04.0 Metin, "sağlam bir uyarlama..."...
4 Yıllardır arşivimde beklerdi birkaç gün önce izledim ve nede... 5.0 4.04.05.04.04.04.04.0 Metinde genel olarak olumlu bir duygu ha...
5 Ben filmi beğendim,özellikle 3d izlemenizi tavsiye ediyorum.... 4.5 4.04.05.05.05.05.04.0 Metin genel olarak pozitif bir duygu dur...
6 Film demeye bile zorlanabileceğiniz bir aksiyon çukuru...10/... 2.0 1.01.05.03.00.01.00.0 Metin oldukça eleştirel bir duygu barınd...
7 güzel aşk filmlerinden biri.. unutulamayan aşklara çizik çek... 4.0 4.04.05.04.05.05.05.0 Metin genel olarak olumlu bir duygu duru...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 ürünü hepsiburadadan alalı 3 hafta oldu. orjinal ve eksiksiz... 2.0 2.02.02.02.02.02.02.0 Metin tamamen olumlu bir duygu ifade edi...
2 ürünlerden çok memnunum, kesinlikle herkese tavsiye ederim. ... 2.0 2.02.02.02.02.02.02.0 Metin tamamen olumlu bir duygu ifade edi...
3 hızlı kargo, temiz alışveriş.teşekkür ederim.... 2.0 2.02.02.02.02.02.02.0 Metin tamamen olumlu bir duygu ifade edi...
4 Çünkü aranan tapınak bu bölgededir .... 1.0 1.01.02.01.01.01.01.0 Metin nötr bir bilgi cümlesidir. Bir bek...
5 bu telefonu başlıca alma nedenlerim ise elimdeki samsung j5... 2.0 1.00.01.01.01.00.01.0 Metinde belirgin bir olumlu veya olumsuz...
6 telefon gayet güzel ben kırmızı aldım baya şık görünüyor bu ... 2.0 2.02.02.01.02.02.02.0 Metin genel olarak olumlu bir duygu ifad...
7 Sert ve acımasız özelliklere sahiptir .... 1.0 0.00.00.00.00.00.00.0 Metin, "sert" ve "acıması...
8 harika bir kokusu var ve tüm odayı sarıyor..... 2.0 2.02.02.02.02.02.02.0 Metin tamamen olumlu bir duygu ifade edi...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 ürünü hepsiburadadan alalı 3 hafta oldu. orjinal ve eksiksiz... 2.0 2.02.02.02.02.02.02.0 Metin tamamen olumlu bir duygu barındırı...
2 ürünlerden çok memnunum, kesinlikle herkese tavsiye ederim. ... 2.0 2.02.02.02.02.02.02.0 Metin tamamen olumlu bir duygu ifade edi...
3 hızlı kargo, temiz alışveriş.teşekkür ederim.... 2.0 2.02.02.02.02.02.02.0 Metin tamamen olumlu bir duygu ifade edi...
4 Çünkü aranan tapınak bu bölgededir .... 1.0 1.01.01.02.01.01.01.0 Metin nötr bir ifade içeriyor. Bir durum...
5 bu telefonu başlıca alma nedenlerim ise elimdeki samsung j5... 2.0 1.00.00.01.00.00.01.0 Metinde belirgin bir mutluluk veya heyec...
6 telefon gayet güzel ben kırmızı aldım baya şık görünüyor bu ... 2.0 2.02.02.01.02.02.02.0 Metin genel olarak olumlu bir duygu içer...
7 Sert ve acımasız özelliklere sahiptir .... 1.0 0.00.00.00.00.00.00.0 Metin, "sert" ve "acıması...
8 harika bir kokusu var ve tüm odayı sarıyor..... 2.0 2.02.02.02.02.02.02.0 Metin tamamen olumlu bir duygu ifade etm...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 I believe that technology has improved our lives significant... 4.0 3.04.05.04.04.03.03.0 A very basic, common statement. Grammati...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 ürünü hepsiburadadan alalı 3 hafta oldu. orjinal ve eksiksiz... 2.0 2.02.02.02.02.02.02.0 Metin tamamen olumlu bir duygu durumu ya...
2 ürünlerden çok memnunum, kesinlikle herkese tavsiye ederim. ... 2.0 2.02.02.02.02.02.02.0 Metin tamamen olumlu bir duygu ifade edi...
3 hızlı kargo, temiz alışveriş.teşekkür ederim.... 2.0 2.02.02.02.02.02.02.0 Metin tamamen olumlu bir duygu ifade edi...
4 Çünkü aranan tapınak bu bölgededir .... 1.0 1.01.02.01.01.01.01.0 Metin nötr bir bilgi cümlesidir. Bir bek...
5 bu telefonu başlıca alma nedenlerim ise elimdeki samsung j5... 2.0 1.00.01.01.01.00.01.0 Metinde belirgin bir olumlu veya olumsuz...
6 telefon gayet güzel ben kırmızı aldım baya şık görünüyor bu ... 2.0 2.02.02.01.02.02.02.0 Metin genel olarak olumlu bir duygu ifad...
7 Sert ve acımasız özelliklere sahiptir .... 1.0 0.00.00.00.00.00.00.0 Metin, "sert" ve "acıması...
8 harika bir kokusu var ve tüm odayı sarıyor..... 2.0 2.02.02.02.02.02.02.0 Metin tamamen olumlu bir duygu ifade edi...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 29 yıl önce Hocalı'da Ermeni işgalcilerin vahşi katliamında,... 1.0 0.00.00.00.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metin,...
2 Bugün yine floodluk bissürü çorbalar çıktı da yazacak vaktim... 0.0 1.00.01.01.00.01.01.0 PUAN: 1 SINIF: offensive AÇIKLAMA: Met...
3 @USER 🍓🍋 ReisTarıma AtamayıAçıklıyor 🍒🍇 Gelecek nesillerin ... 0.0 0.00.00.00.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metin,...
4 🌆tokyo revengers spoiler glb🌆 . . . . . . ya bu cocuklar ort... 1.0 1.01.01.01.01.01.01.0 PUAN: 1 SINIF: offensive AÇIKLAMA: Meti...
5 Siyaset ölsün artıkın bıktık.Saltanat geri gelsin ya da ATAT... 1.0 1.01.01.01.01.01.01.0 PUAN: 1 SINIF: offensive AÇIKLAMA: Met...
6 @USER Boş duran,boş işlerle uğraşan daralır .Gözünle Kainatı... 1.0 0.00.00.00.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metin, ...
7 @USER Umarım.. Sorsam Eylem'i sevmeyecek insanlarin nispet o... 0.0 0.00.01.01.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metind...
8 @322neco amk ergeni ne anlatıyon... 1.0 1.01.01.01.01.01.01.0 PUAN: 1 SINIF: offensive AÇIKLAMA: Met...
9 ÜZERİME GELİNLİK GİYİP , KAFAMA'DA MOTOSİKLET KASKI TAKIP ... 0.0 0.00.01.01.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metin,...
10 @smoketilimhigh3 @highenoughtodi Böyle ironiyi de sikiyim am... 1.0 ----1.0-- PUAN: 1 SINIF: offensive AÇIKLAMA: Metin...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 29 yıl önce Hocalı'da Ermeni işgalcilerin vahşi katliamında,... 1.0 0.00.00.00.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metin,...
2 Bugün yine floodluk bissürü çorbalar çıktı da yazacak vaktim... 0.0 1.00.01.01.00.01.01.0 PUAN: 1 SINIF: offensive AÇIKLAMA: Metin...
3 @USER 🍓🍋 ReisTarıma AtamayıAçıklıyor 🍒🍇 Gelecek nesillerin ... 0.0 0.00.00.00.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metin,...
4 🌆tokyo revengers spoiler glb🌆 . . . . . . ya bu cocuklar ort... 1.0 1.01.01.01.01.01.01.0 PUAN: 1 SINIF: offensive AÇIKLAMA: Meti...
5 Siyaset ölsün artıkın bıktık.Saltanat geri gelsin ya da ATAT... 1.0 1.01.01.01.01.01.01.0 PUAN: 1 SINIF: offensive AÇIKLAMA: Met...
6 @USER Boş duran,boş işlerle uğraşan daralır .Gözünle Kainatı... 1.0 0.00.00.00.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metin d...
7 @USER Umarım.. Sorsam Eylem'i sevmeyecek insanlarin nispet o... 0.0 0.00.01.01.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metinde...
8 @322neco amk ergeni ne anlatıyon... 1.0 1.01.01.01.01.01.01.0 PUAN: 1 SINIF: offensive AÇIKLAMA: Met...
9 ÜZERİME GELİNLİK GİYİP , KAFAMA'DA MOTOSİKLET KASKI TAKIP ... 0.0 0.00.01.01.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metin,...
10 @smoketilimhigh3 @highenoughtodi Böyle ironiyi de sikiyim am... 1.0 --1.0-1.0-- PUAN: 1 SINIF: offensive AÇIKLAMA: Metin...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 oyunculuk güzel ama sadece oyunculuk... konusunu hiç beğenme... 3.0 2.02.02.02.02.02.02.0 Metinde hem olumlu hem de olumsuz duygul...
2 Basit bir senaryo,güzel oyunculuk,güzel film sıkmıyor...... 3.0 4.04.04.05.04.03.04.0 Metin genel olarak olumlu bir duygu duru...
3 Sağlam bir uyarlama...... 5.0 4.04.05.05.05.03.04.0 Metin, "sağlam bir uyarlama..."...
4 Yıllardır arşivimde beklerdi birkaç gün önce izledim ve nede... 5.0 4.04.05.04.05.04.04.0 Metinde genel olarak olumlu bir duygu ha...
5 Ben filmi beğendim,özellikle 3d izlemenizi tavsiye ediyorum.... 4.5 4.04.05.05.05.05.04.0 Metin genel olarak pozitif bir duygu dur...
6 Film demeye bile zorlanabileceğiniz bir aksiyon çukuru...10/... 2.0 1.01.05.03.00.01.00.0 Metin oldukça eleştirel bir duygu barınd...
7 güzel aşk filmlerinden biri.. unutulamayan aşklara çizik çek... 4.0 4.05.05.04.05.05.05.0 Metin genel olarak olumlu bir duygu duru...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 29 yıl önce Hocalı'da Ermeni işgalcilerin vahşi katliamında,... 1.0 0.00.00.00.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metin, ...
2 Bugün yine floodluk bissürü çorbalar çıktı da yazacak vaktim... 0.0 1.00.01.01.00.01.01.0 PUAN: 1 SINIF: offensive AÇIKLAMA: Emo...
3 @USER 🍓🍋 ReisTarıma AtamayıAçıklıyor 🍒🍇 Gelecek nesillerin ... 0.0 0.00.00.00.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metinde...
4 🌆tokyo revengers spoiler glb🌆 . . . . . . ya bu cocuklar ort... 1.0 1.01.01.01.01.01.01.0 PUAN: 1 SINIF: offensive AÇIKLAMA: Meti...
5 Siyaset ölsün artıkın bıktık.Saltanat geri gelsin ya da ATAT... 1.0 1.01.01.01.01.01.01.0 PUAN: 1 SINIF: offensive AÇIKLAMA: Meti...
6 @USER Boş duran,boş işlerle uğraşan daralır .Gözünle Kainatı... 1.0 0.00.00.00.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metin d...
7 @USER Umarım.. Sorsam Eylem'i sevmeyecek insanlarin nispet o... 0.0 0.00.01.01.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metinde ...
8 @322neco amk ergeni ne anlatıyon... 1.0 1.01.01.01.01.01.01.0 PUAN: 1 SINIF: offensive AÇIKLAMA: Metin...
9 ÜZERİME GELİNLİK GİYİP , KAFAMA'DA MOTOSİKLET KASKI TAKIP ... 0.0 0.00.01.01.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metin ...
10 @smoketilimhigh3 @highenoughtodi Böyle ironiyi de sikiyim am... 1.0 ----1.0-- PUAN: 1 SINIF: Offensive AÇIKLAMA: Metin...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 I believe that technology has improved our lives significant... 4.0 2.54.54.54.52.52.53.5 The statement is simple and clear, but l...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 I believe that technology has improved our lives significant... 4.0 2.54.54.24.52.52.53.5 The statement is simple and clear, but l...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 I believe that technology has improved our lives significant... 4.0 2.94.52.54.52.52.53.0 The statement is simple, clear, and expr...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 oyunculuk güzel ama sadece oyunculuk... konusunu hiç beğenme... 3.0 2.02.51.52.52.21.52.0 Metinde karma bir duygu durumu var. Oyun...
2 Basit bir senaryo,güzel oyunculuk,güzel film sıkmıyor...... 3.0 4.03.54.24.54.22.54.0 Metin genel olarak olumlu bir duygu ifad...
3 Sağlam bir uyarlama...... 5.0 3.53.52.54.54.23.54.0 Metin, "sağlam bir uyarlama..."...
4 Yıllardır arşivimde beklerdi birkaç gün önce izledim ve nede... 5.0 3.64.24.84.54.23.54.2 Metinde genel olarak olumlu bir duygu ha...
5 Ben filmi beğendim,özellikle 3d izlemenizi tavsiye ediyorum.... 4.5 4.24.24.54.54.84.54.2 Metin genel olarak oldukça olumlu bir du...
6 Film demeye bile zorlanabileceğiniz bir aksiyon çukuru...10/... 2.0 1.21.24.81.50.20.50.5 Metin genel olarak olumsuz bir duygu taş...
7 güzel aşk filmlerinden biri.. unutulamayan aşklara çizik çek... 4.0 4.24.24.54.54.84.54.2 Metin genel olarak oldukça olumlu bir du...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 29 yıl önce Hocalı'da Ermeni işgalcilerin vahşi katliamında,... 1.0 0.00.00.00.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metin,...
2 Bugün yine floodluk bissürü çorbalar çıktı da yazacak vaktim... 0.0 1.00.01.01.00.01.01.0 PUAN: 1 SINIF: offensive AÇIKLAMA: Met...
3 @USER 🍓🍋 ReisTarıma AtamayıAçıklıyor 🍒🍇 Gelecek nesillerin ... 0.0 0.00.00.01.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metin,...
4 🌆tokyo revengers spoiler glb🌆 . . . . . . ya bu cocuklar ort... 1.0 1.01.01.01.01.01.01.0 PUAN: 1 SINIF: offensive AÇIKLAMA: Met...
5 Siyaset ölsün artıkın bıktık.Saltanat geri gelsin ya da ATAT... 1.0 1.01.01.01.01.01.01.0 PUAN: 1 SINIF: offensive AÇIKLAMA: Meti...
6 @USER Boş duran,boş işlerle uğraşan daralır .Gözünle Kainatı... 1.0 0.00.00.00.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metin di...
7 @USER Umarım.. Sorsam Eylem'i sevmeyecek insanlarin nispet o... 0.0 0.00.01.01.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metind...
8 @322neco amk ergeni ne anlatıyon... 1.0 1.01.01.01.01.01.01.0 PUAN: 1 SINIF: offensive AÇIKLAMA: Met...
9 ÜZERİME GELİNLİK GİYİP , KAFAMA'DA MOTOSİKLET KASKI TAKIP ... 0.0 0.00.01.01.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metin,...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 29 yıl önce Hocalı'da Ermeni işgalcilerin vahşi katliamında,... 1.0 0.00.00.00.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metin,...
2 Bugün yine floodluk bissürü çorbalar çıktı da yazacak vaktim... 0.0 1.00.01.01.01.01.01.0 PUAN: 1 SINIF: offensive AÇIKLAMA: Met...
3 @USER 🍓🍋 ReisTarıma AtamayıAçıklıyor 🍒🍇 Gelecek nesillerin ... 0.0 0.00.00.00.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metin,...
4 🌆tokyo revengers spoiler glb🌆 . . . . . . ya bu cocuklar ort... 1.0 1.01.01.01.01.01.01.0 PUAN: 1 SINIF: offensive AÇIKLAMA: Meti...
5 Siyaset ölsün artıkın bıktık.Saltanat geri gelsin ya da ATAT... 1.0 1.01.01.01.01.01.01.0 PUAN: 1 SINIF: offensive AÇIKLAMA: Metin...
6 @USER Boş duran,boş işlerle uğraşan daralır .Gözünle Kainatı... 1.0 0.00.00.00.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metin, ...
7 @USER Umarım.. Sorsam Eylem'i sevmeyecek insanlarin nispet o... 0.0 0.00.01.01.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metind...
8 @322neco amk ergeni ne anlatıyon... 1.0 1.01.01.01.01.01.01.0 PUAN: 1 SINIF: offensive AÇIKLAMA: Met...
9 ÜZERİME GELİNLİK GİYİP , KAFAMA'DA MOTOSİKLET KASKI TAKIP ... 0.0 0.00.01.01.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metin ...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 29 yıl önce Hocalı'da Ermeni işgalcilerin vahşi katliamında,... 1.0 0.00.00.00.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metin,...
2 Bugün yine floodluk bissürü çorbalar çıktı da yazacak vaktim... 0.0 1.01.01.01.00.01.01.0 PUAN: 1 SINIF: offensive AÇIKLAMA: Met...
3 @USER 🍓🍋 ReisTarıma AtamayıAçıklıyor 🍒🍇 Gelecek nesillerin ... 0.0 0.00.00.01.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metin,...
4 🌆tokyo revengers spoiler glb🌆 . . . . . . ya bu cocuklar ort... 1.0 1.01.01.01.01.01.01.0 PUAN: 1 SINIF: offensive AÇIKLAMA: Met...
5 Siyaset ölsün artıkın bıktık.Saltanat geri gelsin ya da ATAT... 1.0 1.01.01.01.01.01.01.0 PUAN: 1 SINIF: offensive AÇIKLAMA: Met...
6 @USER Boş duran,boş işlerle uğraşan daralır .Gözünle Kainatı... 1.0 0.00.00.00.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metin di...
7 @USER Umarım.. Sorsam Eylem'i sevmeyecek insanlarin nispet o... 0.0 0.00.00.01.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metind...
8 @322neco amk ergeni ne anlatıyon... 1.0 1.01.01.01.01.01.01.0 PUAN: 1 SINIF: offensive AÇIKLAMA: Met...
9 ÜZERİME GELİNLİK GİYİP , KAFAMA'DA MOTOSİKLET KASKI TAKIP ... 0.0 0.00.00.01.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metin ...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 oyunculuk güzel ama sadece oyunculuk... konusunu hiç beğenme... 3.0 2.02.52.52.52.21.52.0 Metinde hem olumlu hem de olumsuz duygul...
2 Basit bir senaryo,güzel oyunculuk,güzel film sıkmıyor...... 3.0 4.03.54.24.54.22.54.0 Metin genel olarak olumlu bir duygu ifad...
3 Sağlam bir uyarlama...... 5.0 3.53.53.04.54.23.54.0 Metin, "sağlam bir uyarlama..."...
4 Yıllardır arşivimde beklerdi birkaç gün önce izledim ve nede... 5.0 3.64.24.84.54.23.54.2 Metinde genel olarak olumlu bir duygu ha...
5 Ben filmi beğendim,özellikle 3d izlemenizi tavsiye ediyorum.... 4.5 4.24.24.84.54.84.54.2 Metin genel olarak oldukça olumlu bir du...
6 Film demeye bile zorlanabileceğiniz bir aksiyon çukuru...10/... 2.0 1.21.24.82.50.20.50.5 Metin genel olarak olumsuz bir duygu ifa...
7 güzel aşk filmlerinden biri.. unutulamayan aşklara çizik çek... 4.0 4.24.24.24.54.84.54.2 Metin genel olarak oldukça olumlu bir du...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 oyunculuk güzel ama sadece oyunculuk... konusunu hiç beğenme... 3.0 2.02.52.00.31.51.52.0 Metinde hem olumlu ("oyunculuk güze...
2 Basit bir senaryo,güzel oyunculuk,güzel film sıkmıyor...... 3.0 4.23.54.24.84.22.54.0 Metin genel olarak olumlu bir duygu duru...
3 Sağlam bir uyarlama...... 5.0 3.53.53.54.54.23.54.0 Metin, "sağlam bir uyarlama..."...
4 Yıllardır arşivimde beklerdi birkaç gün önce izledim ve nede... 5.0 3.63.84.53.84.23.54.2 Metinde genel olarak olumlu bir duygu ha...
5 Ben filmi beğendim,özellikle 3d izlemenizi tavsiye ediyorum.... 4.5 4.24.24.74.84.84.54.2 Metin genel olarak oldukça olumlu bir du...
6 Film demeye bile zorlanabileceğiniz bir aksiyon çukuru...10/... 2.0 1.51.20.51.80.20.50.1 Metin genel olarak olumsuz bir duygu dur...
7 güzel aşk filmlerinden biri.. unutulamayan aşklara çizik çek... 4.0 4.24.54.74.54.84.54.2 Metin genel olarak oldukça olumlu bir du...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 I believe that technology has improved our lives significant... 4.0 4.04.04.04.04.03.04.0 Clear and concise statement of opinion. ...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 I believe that technology has improved our lives significant... 4.0 4.04.05.04.04.03.04.0 Clear and concise statement with a commo...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 ürünü hepsiburadadan alalı 3 hafta oldu. orjinal ve eksiksiz... 2.0 1.91.81.91.51.91.81.8 Metin genel olarak oldukça olumlu bir du...
2 ürünlerden çok memnunum, kesinlikle herkese tavsiye ederim. ... 2.0 1.91.92.01.51.91.91.9 Metin oldukça pozitif bir duygu durumu i...
3 hızlı kargo, temiz alışveriş.teşekkür ederim.... 2.0 1.91.91.81.51.81.81.8 Metin genel olarak olumlu bir duygu duru...
4 Çünkü aranan tapınak bu bölgededir .... 1.0 1.01.01.01.51.21.01.0 Metin nötr bir bilgi cümlesidir. Bir keş...
5 bu telefonu başlıca alma nedenlerim ise elimdeki samsung j5... 2.0 0.70.41.21.51.20.81.0 Metinde belirgin bir olumlu veya olumsuz...
6 telefon gayet güzel ben kırmızı aldım baya şık görünüyor bu ... 2.0 1.81.81.81.51.81.81.8 Metin genel olarak olumlu bir duygu içer...
7 Sert ve acımasız özelliklere sahiptir .... 1.0 0.10.40.00.00.00.30.1 Metin, "sert" ve "acıması...
8 harika bir kokusu var ve tüm odayı sarıyor..... 2.0 1.91.81.81.51.81.91.8 Metin oldukça olumlu bir duygu ifade edi...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 I believe that technology has improved our lives significant... 4.0 4.04.04.04.04.03.04.0 Clear, concise statement of opinion. Gra...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 ürünü hepsiburadadan alalı 3 hafta oldu. orjinal ve eksiksiz... 2.0 1.91.81.91.41.81.81.9 Metin genel olarak çok olumlu bir duygu ...
2 ürünlerden çok memnunum, kesinlikle herkese tavsiye ederim. ... 2.0 1.91.91.91.52.01.91.8 Metin oldukça pozitif bir duygu durumu i...
3 hızlı kargo, temiz alışveriş.teşekkür ederim.... 2.0 1.91.91.81.51.81.81.8 Metin genel olarak olumlu bir duygu duru...
4 Çünkü aranan tapınak bu bölgededir .... 1.0 1.01.01.01.51.21.01.0 Metin nötr ve bilgilendirici bir ifade i...
5 bu telefonu başlıca alma nedenlerim ise elimdeki samsung j5... 2.0 0.70.41.21.01.20.81.0 Metinde belirgin bir olumlu veya olumsuz...
6 telefon gayet güzel ben kırmızı aldım baya şık görünüyor bu ... 2.0 1.81.81.61.91.81.81.8 Metin genel olarak olumlu bir duygu içer...
7 Sert ve acımasız özelliklere sahiptir .... 1.0 0.10.20.01.50.00.30.1 Metin tamamen olumsuz bir duygu ifade ed...
8 harika bir kokusu var ve tüm odayı sarıyor..... 2.0 1.91.82.01.01.81.91.8 Metinde "harika" kelimesinin k...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 ürünü hepsiburadadan alalı 3 hafta oldu. orjinal ve eksiksiz... 2.0 1.91.81.91.51.81.81.8 Metin genel olarak oldukça olumlu bir du...
2 ürünlerden çok memnunum, kesinlikle herkese tavsiye ederim. ... 2.0 1.91.92.01.52.01.92.0 Metin oldukça pozitif bir duygu durumu i...
3 hızlı kargo, temiz alışveriş.teşekkür ederim.... 2.0 1.91.91.51.51.81.81.8 Metin genel olarak olumlu bir duygu duru...
4 Çünkü aranan tapınak bu bölgededir .... 1.0 1.01.01.01.51.21.01.0 Metin nötr bir bilgi cümlesidir. Bir keş...
5 bu telefonu başlıca alma nedenlerim ise elimdeki samsung j5... 2.0 0.70.71.41.51.20.81.0 Metin genel olarak nötr bir duygu durumu...
6 telefon gayet güzel ben kırmızı aldım baya şık görünüyor bu ... 2.0 1.81.81.81.51.81.81.8 Metin genel olarak olumlu bir duygu duru...
7 Sert ve acımasız özelliklere sahiptir .... 1.0 0.20.40.01.50.00.30.1 Metin, "sert" ve "acıması...
8 harika bir kokusu var ve tüm odayı sarıyor..... 2.0 1.91.81.91.51.81.91.8 Metin oldukça olumlu bir duygu ifade edi...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 I believe that technology has improved our lives significant... 4.0 2.54.54.24.22.52.54.0 Very simple, common opinion stated plain...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 I believe that technology has improved our lives significant... 4.0 2.54.54.24.22.52.54.0 Very simple statement of opinion. Gramma...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 I believe that technology has improved our lives significant... 4.0 2.54.52.34.22.12.54.0 Very basic statement; clear but lacks de...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 29 yıl önce Hocalı'da Ermeni işgalcilerin vahşi katliamında,... 1.0 0.00.00.00.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metin,...
2 Bugün yine floodluk bissürü çorbalar çıktı da yazacak vaktim... 0.0 1.00.01.01.00.01.01.0 PUAN: 1 SINIF: offensive AÇIKLAMA: Metin...
3 @USER 🍓🍋 ReisTarıma AtamayıAçıklıyor 🍒🍇 Gelecek nesillerin ... 0.0 0.00.00.01.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metin,...
4 🌆tokyo revengers spoiler glb🌆 . . . . . . ya bu cocuklar ort... 1.0 1.01.01.01.01.01.01.0 PUAN: 1 SINIF: offensive AÇIKLAMA: Met...
5 Siyaset ölsün artıkın bıktık.Saltanat geri gelsin ya da ATAT... 1.0 1.01.01.01.01.01.01.0 PUAN: 1 SINIF: offensive AÇIKLAMA: Met...
6 @USER Boş duran,boş işlerle uğraşan daralır .Gözünle Kainatı... 1.0 0.00.00.00.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metin, ...
7 @USER Umarım.. Sorsam Eylem'i sevmeyecek insanlarin nispet o... 0.0 0.00.01.01.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metin, k...
8 @322neco amk ergeni ne anlatıyon... 1.0 1.01.01.01.01.01.01.0 PUAN: 1 SINIF: offensive AÇIKLAMA: Met...
9 ÜZERİME GELİNLİK GİYİP , KAFAMA'DA MOTOSİKLET KASKI TAKIP ... 0.0 0.00.01.01.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metin,...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 29 yıl önce Hocalı'da Ermeni işgalcilerin vahşi katliamında,... 1.0 0.00.00.00.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metin, H...
2 Bugün yine floodluk bissürü çorbalar çıktı da yazacak vaktim... 0.0 1.00.01.01.00.01.01.0 PUAN: 1 SINIF: offensive AÇIKLAMA: Met...
3 @USER 🍓🍋 ReisTarıma AtamayıAçıklıyor 🍒🍇 Gelecek nesillerin ... 0.0 0.00.00.01.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metin, t...
4 🌆tokyo revengers spoiler glb🌆 . . . . . . ya bu cocuklar ort... 1.0 1.01.01.01.01.01.01.0 PUAN: 1 SINIF: offensive AÇIKLAMA: Met...
5 Siyaset ölsün artıkın bıktık.Saltanat geri gelsin ya da ATAT... 1.0 1.01.01.01.01.01.01.0 PUAN: 1 SINIF: offensive AÇIKLAMA: Met...
6 @USER Boş duran,boş işlerle uğraşan daralır .Gözünle Kainatı... 1.0 0.00.00.00.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metin, ...
7 @USER Umarım.. Sorsam Eylem'i sevmeyecek insanlarin nispet o... 0.0 0.00.01.01.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metind...
8 @322neco amk ergeni ne anlatıyon... 1.0 1.01.01.01.01.01.01.0 PUAN: 1 SINIF: offensive AÇIKLAMA: Met...
9 ÜZERİME GELİNLİK GİYİP , KAFAMA'DA MOTOSİKLET KASKI TAKIP ... 0.0 0.00.01.01.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metin,...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 29 yıl önce Hocalı'da Ermeni işgalcilerin vahşi katliamında,... 1.0 0.00.00.00.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metin,...
2 Bugün yine floodluk bissürü çorbalar çıktı da yazacak vaktim... 0.0 0.00.01.01.00.01.01.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metind...
3 @USER 🍓🍋 ReisTarıma AtamayıAçıklıyor 🍒🍇 Gelecek nesillerin ... 0.0 0.00.00.01.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metin ...
4 🌆tokyo revengers spoiler glb🌆 . . . . . . ya bu cocuklar ort... 1.0 1.01.01.01.01.01.01.0 PUAN: 1 SINIF: offensive AÇIKLAMA: Meti...
5 Siyaset ölsün artıkın bıktık.Saltanat geri gelsin ya da ATAT... 1.0 1.01.01.01.01.01.01.0 PUAN: 1 SINIF: offensive AÇIKLAMA: Meti...
6 @USER Boş duran,boş işlerle uğraşan daralır .Gözünle Kainatı... 1.0 0.00.00.00.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metin d...
7 @USER Umarım.. Sorsam Eylem'i sevmeyecek insanlarin nispet o... 0.0 0.00.01.01.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metinde ...
8 @322neco amk ergeni ne anlatıyon... 1.0 1.01.01.01.01.01.01.0 PUAN: 1 SINIF: offensive AÇIKLAMA: Metin...
9 ÜZERİME GELİNLİK GİYİP , KAFAMA'DA MOTOSİKLET KASKI TAKIP ... 0.0 0.00.01.01.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metin, h...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 oyunculuk güzel ama sadece oyunculuk... konusunu hiç beğenme... 3.0 2.02.02.02.02.02.02.0 Metinde hem olumlu ("oyunculuk güze...
2 Basit bir senaryo,güzel oyunculuk,güzel film sıkmıyor...... 3.0 4.04.04.04.04.03.04.0 Metin genel olarak olumlu bir duygu barı...
3 Sağlam bir uyarlama...... 5.0 3.04.05.05.05.03.04.0 Metin, "sağlam bir uyarlama..."...
4 Yıllardır arşivimde beklerdi birkaç gün önce izledim ve nede... 5.0 4.04.05.04.04.04.04.0 Metinde genel olarak olumlu bir duygu ha...
5 Ben filmi beğendim,özellikle 3d izlemenizi tavsiye ediyorum.... 4.5 4.04.05.05.05.05.04.0 Metin genel olarak pozitif bir duygu dur...
6 Film demeye bile zorlanabileceğiniz bir aksiyon çukuru...10/... 2.0 1.01.05.03.00.01.00.0 Metin oldukça eleştirel bir duygu barınd...
7 güzel aşk filmlerinden biri.. unutulamayan aşklara çizik çek... 4.0 4.04.05.04.05.05.05.0 Metin genel olarak olumlu bir duygu duru...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 oyunculuk güzel ama sadece oyunculuk... konusunu hiç beğenme... 3.0 2.02.02.02.02.02.01.0 Metinde hem olumlu ("oyunculuk güze...
2 Basit bir senaryo,güzel oyunculuk,güzel film sıkmıyor...... 3.0 4.04.04.05.04.03.04.0 Metin genel olarak olumlu bir duygu duru...
3 Sağlam bir uyarlama...... 5.0 3.04.05.05.05.03.04.0 Metin, "sağlam bir uyarlama..."...
4 Yıllardır arşivimde beklerdi birkaç gün önce izledim ve nede... 5.0 4.04.04.03.04.04.04.0 Metinde genel olarak olumlu bir duygu ha...
5 Ben filmi beğendim,özellikle 3d izlemenizi tavsiye ediyorum.... 4.5 4.04.05.05.05.05.05.0 Metin genel olarak pozitif bir duygu dur...
6 Film demeye bile zorlanabileceğiniz bir aksiyon çukuru...10/... 2.0 1.01.05.03.00.01.00.0 Metin oldukça eleştirel bir duygu içeriy...
7 güzel aşk filmlerinden biri.. unutulamayan aşklara çizik çek... 4.0 4.04.05.04.05.05.05.0 Metin genel olarak olumlu bir duygu duru...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 oyunculuk güzel ama sadece oyunculuk... konusunu hiç beğenme... 3.0 2.02.02.02.02.02.02.0 Metinde hem olumlu ("oyunculuk güze...
2 Basit bir senaryo,güzel oyunculuk,güzel film sıkmıyor...... 3.0 4.04.04.04.04.03.04.0 Metin genel olarak olumlu bir duygu duru...
3 Sağlam bir uyarlama...... 5.0 3.04.05.05.04.03.04.0 Metin, "sağlam bir uyarlama..."...
4 Yıllardır arşivimde beklerdi birkaç gün önce izledim ve nede... 5.0 4.04.05.04.04.04.04.0 Metinde genel olarak olumlu bir duygu ha...
5 Ben filmi beğendim,özellikle 3d izlemenizi tavsiye ediyorum.... 4.5 4.04.05.05.05.05.04.0 Metin genel olarak pozitif bir duygu dur...
6 Film demeye bile zorlanabileceğiniz bir aksiyon çukuru...10/... 2.0 1.01.05.03.00.01.00.0 Metin oldukça eleştirel bir duygu barınd...
7 güzel aşk filmlerinden biri.. unutulamayan aşklara çizik çek... 4.0 4.04.05.04.05.05.05.0 Metin genel olarak olumlu bir duygu duru...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 ürünü hepsiburadadan alalı 3 hafta oldu. orjinal ve eksiksiz... 2.0 2.02.02.01.02.02.02.0 Metin tamamen olumlu bir duygu durumu ya...
2 ürünlerden çok memnunum, kesinlikle herkese tavsiye ederim. ... 2.0 2.02.02.02.02.02.02.0 Metin oldukça olumlu bir duygu ifade edi...
3 hızlı kargo, temiz alışveriş.teşekkür ederim.... 2.0 2.02.02.02.02.02.02.0 Metin tamamen olumlu bir duygu ifade edi...
4 Çünkü aranan tapınak bu bölgededir .... 1.0 1.01.02.01.01.01.01.0 Metin nötr bir bilgi cümlesidir. Bir bek...
5 bu telefonu başlıca alma nedenlerim ise elimdeki samsung j5... 2.0 1.00.01.01.01.00.01.0 Metinde belirgin bir olumlu veya olumsuz...
6 telefon gayet güzel ben kırmızı aldım baya şık görünüyor bu ... 2.0 2.02.02.01.02.02.02.0 Metin genel olarak olumlu bir duygu ifad...
7 Sert ve acımasız özelliklere sahiptir .... 1.0 0.00.00.00.00.00.00.0 Metin, "sert" ve "acıması...
8 harika bir kokusu var ve tüm odayı sarıyor..... 2.0 2.02.02.02.02.02.02.0 Metin tamamen olumlu bir duygu ifade edi...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 ürünü hepsiburadadan alalı 3 hafta oldu. orjinal ve eksiksiz... 2.0 2.02.02.01.02.02.02.0 Metin tamamen olumlu bir duygu durumu ya...
2 ürünlerden çok memnunum, kesinlikle herkese tavsiye ederim. ... 2.0 2.02.02.02.02.02.02.0 Metin tamamen olumlu bir duygu ifade etm...
3 hızlı kargo, temiz alışveriş.teşekkür ederim.... 2.0 2.02.02.01.02.02.02.0 Metin tamamen olumlu bir duygu ifade edi...
4 Çünkü aranan tapınak bu bölgededir .... 1.0 1.01.00.01.01.01.01.0 Metin nötr bir bilgi cümlesidir. Bir bek...
5 bu telefonu başlıca alma nedenlerim ise elimdeki samsung j5... 2.0 1.00.01.01.01.00.01.0 Metinde belirgin bir olumlu veya olumsuz...
6 telefon gayet güzel ben kırmızı aldım baya şık görünüyor bu ... 2.0 2.02.02.01.02.02.02.0 Metin genel olarak olumlu bir duygu duru...
7 Sert ve acımasız özelliklere sahiptir .... 1.0 0.00.00.00.00.00.00.0 Metin, "sert" ve "acıması...
8 harika bir kokusu var ve tüm odayı sarıyor..... 2.0 2.02.02.02.02.02.02.0 Metin tamamen olumlu bir duygu ifade edi...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 ürünü hepsiburadadan alalı 3 hafta oldu. orjinal ve eksiksiz... 2.0 2.02.02.01.02.02.02.0 Metin tamamen olumlu bir duygu belirtmek...
2 ürünlerden çok memnunum, kesinlikle herkese tavsiye ederim. ... 2.0 2.02.02.02.02.02.02.0 Metin tamamen olumlu bir duygu içeriyor....
3 hızlı kargo, temiz alışveriş.teşekkür ederim.... 2.0 2.02.02.01.02.02.02.0 Metin, "hızlı kargo" ve "...
4 Çünkü aranan tapınak bu bölgededir .... 1.0 1.01.02.01.01.01.01.0 Metin nötr bir bilgiyi ifade ediyor. &qu...
5 bu telefonu başlıca alma nedenlerim ise elimdeki samsung j5... 2.0 1.00.01.00.01.00.01.0 Metinde genel bir memnuniyetsizlik durum...
6 telefon gayet güzel ben kırmızı aldım baya şık görünüyor bu ... 2.0 2.02.02.01.02.02.02.0 Metin genel olarak olumlu bir duygu içer...
7 Sert ve acımasız özelliklere sahiptir .... 1.0 0.00.00.00.00.00.00.0 Metinde "sert" ve "acımas...
8 harika bir kokusu var ve tüm odayı sarıyor..... 2.0 2.02.02.02.02.02.02.0 Metin tamamen olumlu bir duygu ifade edi...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 29 yıl önce Hocalı'da Ermeni işgalcilerin vahşi katliamında,... 1.0 0.00.00.00.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metin,...
2 Bugün yine floodluk bissürü çorbalar çıktı da yazacak vaktim... 0.0 1.00.01.01.00.01.01.0 PUAN: 1 SINIF: offensive AÇIKLAMA: Met...
3 @USER 🍓🍋 ReisTarıma AtamayıAçıklıyor 🍒🍇 Gelecek nesillerin ... 0.0 0.00.00.01.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metin,...
4 🌆tokyo revengers spoiler glb🌆 . . . . . . ya bu cocuklar ort... 1.0 1.01.01.01.01.01.01.0 PUAN: 1 SINIF: offensive AÇIKLAMA: Meti...
5 Siyaset ölsün artıkın bıktık.Saltanat geri gelsin ya da ATAT... 1.0 1.01.01.01.01.01.01.0 PUAN: 1 SINIF: offensive AÇIKLAMA: Meti...
6 @USER Boş duran,boş işlerle uğraşan daralır .Gözünle Kainatı... 1.0 0.00.00.00.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metin, d...
7 @USER Umarım.. Sorsam Eylem'i sevmeyecek insanlarin nispet o... 0.0 0.00.01.01.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metind...
8 @322neco amk ergeni ne anlatıyon... 1.0 1.01.01.01.01.01.01.0 PUAN: 1 SINIF: offensive AÇIKLAMA: Met...
9 ÜZERİME GELİNLİK GİYİP , KAFAMA'DA MOTOSİKLET KASKI TAKIP ... 0.0 0.00.01.01.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metin,...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 I believe that technology has improved our lives significant... 4.0 2.02.02.02.02.02.02.0 The text presents a simple statement but...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 I believe that technology has improved our lives significant... 4.0 2.02.02.03.02.02.02.0 The text presents a simple statement but...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 29 yıl önce Hocalı'da Ermeni işgalcilerin vahşi katliamında,... 1.0 0.00.00.00.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metin, H...
2 Bugün yine floodluk bissürü çorbalar çıktı da yazacak vaktim... 0.0 1.00.01.01.00.01.01.0 PUAN: 1 SINIF: offensive AÇIKLAMA: Met...
3 @USER 🍓🍋 ReisTarıma AtamayıAçıklıyor 🍒🍇 Gelecek nesillerin ... 0.0 0.00.00.00.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metin,...
4 🌆tokyo revengers spoiler glb🌆 . . . . . . ya bu cocuklar ort... 1.0 1.01.01.01.01.01.01.0 PUAN: 1 SINIF: offensive AÇIKLAMA: Met...
5 Siyaset ölsün artıkın bıktık.Saltanat geri gelsin ya da ATAT... 1.0 1.01.01.01.01.01.01.0 PUAN: 1 SINIF: offensive AÇIKLAMA: Met...
6 @USER Boş duran,boş işlerle uğraşan daralır .Gözünle Kainatı... 1.0 0.00.00.00.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metin, ...
7 @USER Umarım.. Sorsam Eylem'i sevmeyecek insanlarin nispet o... 0.0 0.00.01.01.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metind...
8 @322neco amk ergeni ne anlatıyon... 1.0 1.01.01.01.01.01.01.0 PUAN: 1 SINIF: offensive AÇIKLAMA: Met...
9 ÜZERİME GELİNLİK GİYİP , KAFAMA'DA MOTOSİKLET KASKI TAKIP ... 0.0 0.00.01.01.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metin,...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 I believe that technology has improved our lives significant... 4.0 2.02.02.04.02.02.02.0 The text presents a simple statement but...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 29 yıl önce Hocalı'da Ermeni işgalcilerin vahşi katliamında,... 1.0 0.00.00.00.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metin,...
2 Bugün yine floodluk bissürü çorbalar çıktı da yazacak vaktim... 0.0 1.00.01.01.00.01.01.0 PUAN: 1 SINIF: offensive AÇIKLAMA: Met...
3 @USER 🍓🍋 ReisTarıma AtamayıAçıklıyor 🍒🍇 Gelecek nesillerin ... 0.0 0.00.00.00.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metinde...
4 🌆tokyo revengers spoiler glb🌆 . . . . . . ya bu cocuklar ort... 1.0 1.01.01.01.01.01.01.0 PUAN: 1 SINIF: offensive AÇIKLAMA: Meti...
5 Siyaset ölsün artıkın bıktık.Saltanat geri gelsin ya da ATAT... 1.0 1.01.01.01.01.01.01.0 PUAN: 1 SINIF: offensive AÇIKLAMA: Met...
6 @USER Boş duran,boş işlerle uğraşan daralır .Gözünle Kainatı... 1.0 0.00.00.00.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metin,...
7 @USER Umarım.. Sorsam Eylem'i sevmeyecek insanlarin nispet o... 0.0 0.00.00.01.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metind...
8 @322neco amk ergeni ne anlatıyon... 1.0 1.01.01.01.01.01.01.0 PUAN: 1 SINIF: offensive AÇIKLAMA: Meti...
9 ÜZERİME GELİNLİK GİYİP , KAFAMA'DA MOTOSİKLET KASKI TAKIP ... 0.0 0.00.00.01.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metin ...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 oyunculuk güzel ama sadece oyunculuk... konusunu hiç beğenme... 3.0 2.02.51.81.52.21.51.5 Metinde hem olumlu ("oyunculuk güze...
2 Basit bir senaryo,güzel oyunculuk,güzel film sıkmıyor...... 3.0 4.23.54.24.54.22.54.0 Metin genel olarak olumlu bir duygu duru...
3 Sağlam bir uyarlama...... 5.0 3.53.52.54.54.23.54.0 Metin, "sağlam bir uyarlama..."...
4 Yıllardır arşivimde beklerdi birkaç gün önce izledim ve nede... 5.0 3.64.24.84.54.23.54.2 Metinde genel olarak olumlu bir duygu ha...
5 Ben filmi beğendim,özellikle 3d izlemenizi tavsiye ediyorum.... 4.5 4.24.24.84.54.84.54.2 Metin genel olarak oldukça olumlu bir du...
6 Film demeye bile zorlanabileceğiniz bir aksiyon çukuru...10/... 2.0 1.21.25.01.50.20.51.2 Metin oldukça eleştirel bir duygu barınd...
7 güzel aşk filmlerinden biri.. unutulamayan aşklara çizik çek... 4.0 --4.5-4.8-- Pozitif/Olumlu duygu, çünkü metinde &quo...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 oyunculuk güzel ama sadece oyunculuk... konusunu hiç beğenme... 3.0 2.02.52.02.52.21.51.5 Metin genel olarak karmaşık bir duygu du...
2 Basit bir senaryo,güzel oyunculuk,güzel film sıkmıyor...... 3.0 4.03.54.24.54.22.54.0 Metin genel olarak olumlu bir duygu ifad...
3 Sağlam bir uyarlama...... 5.0 3.53.52.04.54.23.54.0 Metin, "sağlam bir uyarlama..."...
4 Yıllardır arşivimde beklerdi birkaç gün önce izledim ve nede... 5.0 3.64.24.84.54.23.54.2 Metinde genel olarak olumlu bir duygu ha...
5 Ben filmi beğendim,özellikle 3d izlemenizi tavsiye ediyorum.... 4.5 4.24.24.84.54.84.54.2 Metin genel olarak oldukça olumlu bir du...
6 Film demeye bile zorlanabileceğiniz bir aksiyon çukuru...10/... 2.0 1.21.24.83.50.20.51.2 Metin genel olarak olumsuz bir duygu ifa...
7 güzel aşk filmlerinden biri.. unutulamayan aşklara çizik çek... 4.0 --4.54.54.8-- Pozitif/Olumlu duygu, çünkü metinde &quo...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 oyunculuk güzel ama sadece oyunculuk... konusunu hiç beğenme... 3.0 2.02.52.53.22.21.52.0 Metinde karışık duygular var. "Oyun...
2 Basit bir senaryo,güzel oyunculuk,güzel film sıkmıyor...... 3.0 4.23.53.44.54.22.54.2 Metin genel olarak olumlu bir duygu duru...
3 Sağlam bir uyarlama...... 5.0 3.53.54.53.04.23.54.0 Metin, "sağlam bir uyarlama..."...
4 Yıllardır arşivimde beklerdi birkaç gün önce izledim ve nede... 5.0 3.64.24.84.54.23.54.2 Metinde genel olarak olumlu bir duygu ha...
5 Ben filmi beğendim,özellikle 3d izlemenizi tavsiye ediyorum.... 4.5 4.23.84.02.54.84.54.2 Metin genel olarak oldukça olumlu bir du...
6 Film demeye bile zorlanabileceğiniz bir aksiyon çukuru...10/... 2.0 1.21.21.23.20.20.50.2 Metin oldukça eleştirel bir duygu taşıyo...
7 güzel aşk filmlerinden biri.. unutulamayan aşklara çizik çek... 4.0 ----4.8-- Metinde kullanılan "güzel", &q...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 ürünü hepsiburadadan alalı 3 hafta oldu. orjinal ve eksiksiz... 2.0 1.91.91.91.51.81.81.8 Metin genel olarak oldukça olumlu bir du...
2 ürünlerden çok memnunum, kesinlikle herkese tavsiye ederim. ... 2.0 1.91.91.91.51.91.91.9 Metin oldukça olumlu bir duygu durumu ya...
3 hızlı kargo, temiz alışveriş.teşekkür ederim.... 2.0 1.91.91.91.51.81.81.8 Metin genel olarak olumlu bir duygu ifad...
4 Çünkü aranan tapınak bu bölgededir .... 1.0 1.01.01.01.51.21.01.0 Metin nötr bir ifade taşıyor. Bir bilgi ...
5 bu telefonu başlıca alma nedenlerim ise elimdeki samsung j5... 2.0 0.70.71.11.51.20.81.0 Metinde belirgin bir olumlu veya olumsuz...
6 telefon gayet güzel ben kırmızı aldım baya şık görünüyor bu ... 2.0 1.81.81.81.51.81.81.8 Metin genel olarak olumlu bir duygu içer...
7 Sert ve acımasız özelliklere sahiptir .... 1.0 0.10.40.00.00.00.30.1 Metin açıkça olumsuz bir duygu ifade edi...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 ürünü hepsiburadadan alalı 3 hafta oldu. orjinal ve eksiksiz... 2.0 1.91.91.91.51.81.81.8 Metin genel olarak oldukça olumlu bir du...
2 ürünlerden çok memnunum, kesinlikle herkese tavsiye ederim. ... 2.0 1.91.92.01.51.91.92.0 Metin oldukça olumlu bir duygu durumu if...
3 hızlı kargo, temiz alışveriş.teşekkür ederim.... 2.0 1.91.91.81.51.81.81.8 Metin genel olarak olumlu bir duygu ifad...
4 Çünkü aranan tapınak bu bölgededir .... 1.0 1.01.01.01.51.21.01.0 Metin nötr bir ifade içeriyor. Bir bilgi...
5 bu telefonu başlıca alma nedenlerim ise elimdeki samsung j5... 2.0 0.70.71.21.51.20.81.0 Metinde belirgin bir olumlu veya olumsuz...
6 telefon gayet güzel ben kırmızı aldım baya şık görünüyor bu ... 2.0 1.81.81.81.51.81.81.8 Metin genel olarak olumlu bir duygu içer...
7 Sert ve acımasız özelliklere sahiptir .... 1.0 0.10.40.01.50.00.30.1 Metin, "sert" ve "acıması...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 ürünü hepsiburadadan alalı 3 hafta oldu. orjinal ve eksiksiz... 2.0 1.91.82.01.51.91.81.8 Metin genel olarak çok olumlu bir duygu ...
2 ürünlerden çok memnunum, kesinlikle herkese tavsiye ederim. ... 2.0 1.91.92.01.52.01.91.9 Metin oldukça pozitif bir duygu ifade ed...
3 hızlı kargo, temiz alışveriş.teşekkür ederim.... 2.0 1.91.92.01.21.81.81.8 Metin büyük ölçüde olumlu bir duygu ifad...
4 Çünkü aranan tapınak bu bölgededir .... 1.0 1.01.01.00.01.21.01.0 Metin nötr bir bilgiyi ifade ediyor. Bir...
5 bu telefonu başlıca alma nedenlerim ise elimdeki samsung j5... 2.0 0.70.41.21.21.20.80.8 Metinde belirgin bir olumlu veya olumsuz...
6 telefon gayet güzel ben kırmızı aldım baya şık görünüyor bu ... 2.0 1.81.81.81.51.81.81.8 Metin genel olarak olumlu bir duygu içer...
7 Sert ve acımasız özelliklere sahiptir .... 1.0 0.10.20.01.20.00.30.3 Metin genel olarak negatif bir duygu dur...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 I believe that technology has improved our lives significant... 4.0 2.12.52.52.52.12.22.0 The text presents a simple statement of ...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 I believe that technology has improved our lives significant... 4.0 2.12.52.52.52.12.22.0 The text presents a simple statement of ...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 I believe that technology has improved our lives significant... 4.0 2.12.02.54.52.12.22.5 Extremely limited content, lacking depth...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 29 yıl önce Hocalı'da Ermeni işgalcilerin vahşi katliamında,... 1.0 0.00.00.00.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metin,...
2 Bugün yine floodluk bissürü çorbalar çıktı da yazacak vaktim... 0.0 1.00.01.01.00.01.01.0 PUAN: 1 SINIF: offensive AÇIKLAMA: Metin...
3 @USER 🍓🍋 ReisTarıma AtamayıAçıklıyor 🍒🍇 Gelecek nesillerin ... 0.0 0.00.00.01.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metin,...
4 🌆tokyo revengers spoiler glb🌆 . . . . . . ya bu cocuklar ort... 1.0 1.01.01.01.01.01.01.0 PUAN: 1 SINIF: offensive AÇIKLAMA: Met...
5 Siyaset ölsün artıkın bıktık.Saltanat geri gelsin ya da ATAT... 1.0 1.01.01.01.01.01.01.0 PUAN: 1 SINIF: offensive AÇIKLAMA: Meti...
6 @USER Boş duran,boş işlerle uğraşan daralır .Gözünle Kainatı... 1.0 0.00.00.00.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metin, ...
7 @USER Umarım.. Sorsam Eylem'i sevmeyecek insanlarin nispet o... 0.0 0.00.01.01.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metind...
8 @322neco amk ergeni ne anlatıyon... 1.0 1.01.01.01.01.01.01.0 PUAN: 1 SINIF: offensive AÇIKLAMA: Met...
9 ÜZERİME GELİNLİK GİYİP , KAFAMA'DA MOTOSİKLET KASKI TAKIP ... 0.0 0.00.01.01.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metin,...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 29 yıl önce Hocalı'da Ermeni işgalcilerin vahşi katliamında,... 1.0 0.00.00.00.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metin,...
2 Bugün yine floodluk bissürü çorbalar çıktı da yazacak vaktim... 0.0 1.00.01.01.00.01.01.0 PUAN: 1 SINIF: offensive AÇIKLAMA: Met...
3 @USER 🍓🍋 ReisTarıma AtamayıAçıklıyor 🍒🍇 Gelecek nesillerin ... 0.0 0.00.00.01.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metin,...
4 🌆tokyo revengers spoiler glb🌆 . . . . . . ya bu cocuklar ort... 1.0 1.01.01.01.01.01.01.0 PUAN: 1 SINIF: offensive AÇIKLAMA: Metin...
5 Siyaset ölsün artıkın bıktık.Saltanat geri gelsin ya da ATAT... 1.0 1.01.01.01.01.01.01.0 PUAN: 1 SINIF: offensive AÇIKLAMA: Meti...
6 @USER Boş duran,boş işlerle uğraşan daralır .Gözünle Kainatı... 1.0 0.00.00.00.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metin di...
7 @USER Umarım.. Sorsam Eylem'i sevmeyecek insanlarin nispet o... 0.0 0.00.01.01.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metind...
8 @322neco amk ergeni ne anlatıyon... 1.0 1.01.01.01.01.01.01.0 PUAN: 1 SINIF: offensive AÇIKLAMA: Metin...
9 ÜZERİME GELİNLİK GİYİP , KAFAMA'DA MOTOSİKLET KASKI TAKIP ... 0.0 0.00.01.01.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metin,...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 29 yıl önce Hocalı'da Ermeni işgalcilerin vahşi katliamında,... 1.0 0.00.00.00.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metin, ...
2 Bugün yine floodluk bissürü çorbalar çıktı da yazacak vaktim... 0.0 1.01.01.01.01.01.01.0 PUAN: 1 SINIF: offensive AÇIKLAMA: Metin...
3 @USER 🍓🍋 ReisTarıma AtamayıAçıklıyor 🍒🍇 Gelecek nesillerin ... 0.0 0.00.00.01.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metinde...
4 🌆tokyo revengers spoiler glb🌆 . . . . . . ya bu cocuklar ort... 1.0 1.01.01.01.01.01.01.0 PUAN: 1 SINIF: offensive AÇIKLAMA: Meti...
5 Siyaset ölsün artıkın bıktık.Saltanat geri gelsin ya da ATAT... 1.0 1.01.01.01.01.01.01.0 PUAN: 1 SINIF: offensive AÇIKLAMA: Met...
6 @USER Boş duran,boş işlerle uğraşan daralır .Gözünle Kainatı... 1.0 0.00.00.00.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metin ...
7 @USER Umarım.. Sorsam Eylem'i sevmeyecek insanlarin nispet o... 0.0 0.00.00.01.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metin,...
8 @322neco amk ergeni ne anlatıyon... 1.0 1.01.01.01.01.01.01.0 PUAN: 1 SINIF: offensive AÇIKLAMA: Metin...
9 ÜZERİME GELİNLİK GİYİP , KAFAMA'DA MOTOSİKLET KASKI TAKIP ... 0.0 0.00.01.01.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metin,...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 I believe that technology has improved our lives significant... 4.0 ----2.0-- {"score": 2, "explanation...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 I believe that technology has improved our lives significant... 4.0 ----2.0-- {"score": 2, "explanation...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 29 yıl önce Hocalı'da Ermeni işgalcilerin vahşi katliamında,... 1.0 0.00.00.00.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metin,...
2 Bugün yine floodluk bissürü çorbalar çıktı da yazacak vaktim... 0.0 1.00.01.01.00.01.01.0 PUAN: 1 SINIF: offensive AÇIKLAMA: Met...
3 @USER 🍓🍋 ReisTarıma AtamayıAçıklıyor 🍒🍇 Gelecek nesillerin ... 0.0 0.00.00.01.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metin,...
4 🌆tokyo revengers spoiler glb🌆 . . . . . . ya bu cocuklar ort... 1.0 1.01.01.01.01.01.01.0 PUAN: 1 SINIF: offensive AÇIKLAMA: Met...
5 Siyaset ölsün artıkın bıktık.Saltanat geri gelsin ya da ATAT... 1.0 1.01.01.01.01.01.01.0 PUAN: 1 SINIF: offensive AÇIKLAMA: Metin...
6 @USER Boş duran,boş işlerle uğraşan daralır .Gözünle Kainatı... 1.0 0.00.00.00.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metin, ...
7 @USER Umarım.. Sorsam Eylem'i sevmeyecek insanlarin nispet o... 0.0 0.00.01.01.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metind...
8 @322neco amk ergeni ne anlatıyon... 1.0 1.01.01.01.01.01.01.0 PUAN: 1 SINIF: offensive AÇIKLAMA: Met...
9 ÜZERİME GELİNLİK GİYİP , KAFAMA'DA MOTOSİKLET KASKI TAKIP ... 0.0 0.00.01.01.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metin, h...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 29 yıl önce Hocalı'da Ermeni işgalcilerin vahşi katliamında,... 1.0 0.00.00.00.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metin,...
2 Bugün yine floodluk bissürü çorbalar çıktı da yazacak vaktim... 0.0 1.00.01.01.00.01.01.0 PUAN: 1 SINIF: offensive AÇIKLAMA: Metin...
3 @USER 🍓🍋 ReisTarıma AtamayıAçıklıyor 🍒🍇 Gelecek nesillerin ... 0.0 0.00.00.01.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metin,...
4 🌆tokyo revengers spoiler glb🌆 . . . . . . ya bu cocuklar ort... 1.0 1.01.01.01.01.01.01.0 PUAN: 1 SINIF: offensive AÇIKLAMA: Meti...
5 Siyaset ölsün artıkın bıktık.Saltanat geri gelsin ya da ATAT... 1.0 1.01.01.01.01.01.01.0 PUAN: 1 SINIF: offensive AÇIKLAMA: Met...
6 @USER Boş duran,boş işlerle uğraşan daralır .Gözünle Kainatı... 1.0 0.00.00.00.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metinde ...
7 @USER Umarım.. Sorsam Eylem'i sevmeyecek insanlarin nispet o... 0.0 0.00.00.01.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metind...
8 @322neco amk ergeni ne anlatıyon... 1.0 1.01.01.01.01.01.01.0 PUAN: 1 SINIF: offensive AÇIKLAMA: Met...
9 ÜZERİME GELİNLİK GİYİP , KAFAMA'DA MOTOSİKLET KASKI TAKIP ... 0.0 0.00.01.01.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metin,...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 29 yıl önce Hocalı'da Ermeni işgalcilerin vahşi katliamında,... 1.0 0.00.00.00.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metin,...
2 Bugün yine floodluk bissürü çorbalar çıktı da yazacak vaktim... 0.0 1.00.01.01.00.01.01.0 PUAN: 1 SINIF: offensive AÇIKLAMA: Metin...
3 @USER 🍓🍋 ReisTarıma AtamayıAçıklıyor 🍒🍇 Gelecek nesillerin ... 0.0 0.00.00.00.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metinde ...
4 🌆tokyo revengers spoiler glb🌆 . . . . . . ya bu cocuklar ort... 1.0 1.01.01.01.01.01.01.0 PUAN: 1 SINIF: offensive AÇIKLAMA: Metin...
5 Siyaset ölsün artıkın bıktık.Saltanat geri gelsin ya da ATAT... 1.0 1.01.01.01.01.01.01.0 PUAN: 1 SINIF: offensive AÇIKLAMA: Met...
6 @USER Boş duran,boş işlerle uğraşan daralır .Gözünle Kainatı... 1.0 0.00.00.00.00.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metin,...
7 @USER Umarım.. Sorsam Eylem'i sevmeyecek insanlarin nispet o... 0.0 0.00.01.01.01.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metinde ...
8 @322neco amk ergeni ne anlatıyon... 1.0 1.01.01.01.01.01.01.0 PUAN: 1 SINIF: offensive AÇIKLAMA: &qu...
9 ÜZERİME GELİNLİK GİYİP , KAFAMA'DA MOTOSİKLET KASKI TAKIP ... 0.0 0.00.00.0-0.00.00.0 PUAN: 0 SINIF: normal AÇIKLAMA: Metind...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 ürünü hepsiburadadan alalı 3 hafta oldu. orjinal ve eksiksiz... 2.0 2.02.02.02.02.02.02.0 Metin tamamen olumlu bir duygu ifade edi...
2 ürünlerden çok memnunum, kesinlikle herkese tavsiye ederim. ... 2.0 2.02.02.02.02.02.02.0 Metin tamamen olumlu bir duygu ifade etm...
3 hızlı kargo, temiz alışveriş.teşekkür ederim.... 2.0 2.02.02.02.02.02.02.0 Metin tamamen olumlu bir duygu ifade edi...
4 Çünkü aranan tapınak bu bölgededir .... 1.0 1.01.00.01.01.01.01.0 Metin nötr bir bilgi cümlesi içeriyor. B...
5 bu telefonu başlıca alma nedenlerim ise elimdeki samsung j5... 2.0 1.00.01.01.01.00.01.0 Metinde belirgin bir olumlu veya olumsuz...
6 telefon gayet güzel ben kırmızı aldım baya şık görünüyor bu ... 2.0 2.02.02.01.02.02.02.0 Metin genel olarak olumlu bir duygu duru...
7 Sert ve acımasız özelliklere sahiptir .... 1.0 0.00.00.00.00.00.00.0 Metin, "sert" ve "acıması...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 oyunculuk güzel ama sadece oyunculuk... konusunu hiç beğenme... 3.0 2.02.02.02.02.02.02.0 Metinde hem olumlu ("oyunculuk güze...
2 Basit bir senaryo,güzel oyunculuk,güzel film sıkmıyor...... 3.0 4.04.04.04.04.03.04.0 Metin genel olarak olumlu bir duygu duru...
3 Sağlam bir uyarlama...... 5.0 3.04.05.05.04.03.04.0 Metin, "sağlam bir uyarlama..."...
4 Yıllardır arşivimde beklerdi birkaç gün önce izledim ve nede... 5.0 4.04.05.04.04.04.04.0 Metinde genel olarak olumlu bir duygu ha...
5 Ben filmi beğendim,özellikle 3d izlemenizi tavsiye ediyorum.... 4.5 4.04.05.05.05.05.04.0 Metin genel olarak pozitif bir duygu dur...
6 Film demeye bile zorlanabileceğiniz bir aksiyon çukuru...10/... 2.0 1.01.05.03.00.01.00.0 Metin oldukça eleştirel bir duygu barınd...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 Colorado, middle of nowhere.... 1.0 1.00.00.00.00.00.00.0 The text evokes a sense of isolation or ...
2 This involved swimming a pretty large lake that was over my ... 2.0 1.02.02.02.01.01.01.0 The text evokes a mild sense of apprehen...
3 It was one of my most shameful experiences.... 3.0 2.03.03.03.03.03.03.0 The word "shameful" carries a ...
4 After all, I had vegetables coming out my ears all for the b... 0.0 2.01.02.00.01.00.01.0 The text expresses a clear sense of mild...
5 Then the screaming started.... 3.0 3.03.03.03.03.03.03.0 The single sentence, “Then the screaming...
6 They don't fear death, and it seems they believe in reincarn... 1.0 1.01.00.00.00.00.01.0 The text evokes a sense of peaceful acce...
7 You know what happens when I get one of these stupid ideas i... 1.0 2.02.02.02.02.01.02.0 The text expresses frustration and annoy...
8 My stomach even started giving me fits.... 2.0 2.02.02.03.02.02.01.0 The phrase "stomach giving me fits&...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 ürünü hepsiburadadan alalı 3 hafta oldu. orjinal ve eksiksiz... 2.0 2.02.02.01.02.02.02.0 Metin tamamen olumlu bir duygu ifade edi...
2 ürünlerden çok memnunum, kesinlikle herkese tavsiye ederim. ... 2.0 2.02.02.02.02.02.02.0 Metin tamamen olumlu bir duygu ifade edi...
3 hızlı kargo, temiz alışveriş.teşekkür ederim.... 2.0 2.02.02.02.02.02.02.0 Metin tamamen olumlu bir duygu ifade edi...
4 Çünkü aranan tapınak bu bölgededir .... 1.0 1.01.01.01.01.01.01.0 Metin nötr bir bilgi cümlesidir. Bir bek...
5 bu telefonu başlıca alma nedenlerim ise elimdeki samsung j5... 2.0 1.00.01.01.01.00.01.0 Metinde belirgin bir olumlu veya olumsuz...
6 telefon gayet güzel ben kırmızı aldım baya şık görünüyor bu ... 2.0 2.02.02.01.02.02.02.0 Metin genel olarak olumlu bir duygu ifad...
7 Sert ve acımasız özelliklere sahiptir .... 1.0 0.00.00.00.00.00.00.0 Metin, "sert" ve "acıması...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 Colorado, middle of nowhere.... 1.0 1.00.00.00.00.00.00.0 The text evokes a sense of isolation or ...
2 This involved swimming a pretty large lake that was over my ... 2.0 1.02.02.01.01.01.01.0 The text conveys a mild sense of apprehe...
3 It was one of my most shameful experiences.... 3.0 2.03.03.03.02.03.03.0 The word "shameful" carries a ...
4 After all, I had vegetables coming out my ears all for the b... 0.0 2.01.01.00.01.00.01.0 The text expresses a clear sense of exas...
5 Then the screaming started.... 3.0 3.03.03.03.03.03.03.0 The single sentence, “Then the screaming...
6 They don't fear death, and it seems they believe in reincarn... 1.0 1.01.00.00.00.00.01.0 The text evokes a sense of peaceful acce...
7 You know what happens when I get one of these stupid ideas i... 1.0 2.02.02.02.02.01.02.0 The text expresses frustration and annoy...
8 My stomach even started giving me fits.... 2.0 2.02.02.03.02.02.01.0 The phrase "stomach giving me fits&...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 oyunculuk güzel ama sadece oyunculuk... konusunu hiç beğenme... 3.0 2.02.02.02.02.02.02.0 Metinde hem olumlu ("oyunculuk güze...
2 Basit bir senaryo,güzel oyunculuk,güzel film sıkmıyor...... 3.0 4.04.04.04.05.03.04.0 Metin genel olarak olumlu bir duygu duru...
3 Sağlam bir uyarlama...... 5.0 3.04.05.05.04.03.04.0 Metin, "sağlam bir uyarlama..."...
4 Yıllardır arşivimde beklerdi birkaç gün önce izledim ve nede... 5.0 4.04.05.04.04.04.04.0 Metinde genel olarak olumlu bir duygu ha...
5 Ben filmi beğendim,özellikle 3d izlemenizi tavsiye ediyorum.... 4.5 4.04.05.05.05.05.04.0 Metin genel olarak pozitif bir duygu dur...
6 Film demeye bile zorlanabileceğiniz bir aksiyon çukuru...10/... 2.0 1.01.05.03.00.01.00.0 Metin oldukça eleştirel bir duygu barınd...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 Colorado, middle of nowhere.... 1.0 1.00.00.00.00.00.01.0 The text evokes a sense of isolation or ...
2 This involved swimming a pretty large lake that was over my ... 2.0 1.02.02.02.02.01.02.0 The text conveys a mild sense of apprehe...
3 It was one of my most shameful experiences.... 3.0 2.03.02.03.02.03.03.0 The word "shameful" carries a ...
4 After all, I had vegetables coming out my ears all for the b... 0.0 2.01.02.00.01.00.01.0 The text expresses a clear sense of rese...
5 Then the screaming started.... 3.0 3.03.03.03.03.03.03.0 The word "screaming" immediate...
6 They don't fear death, and it seems they believe in reincarn... 1.0 1.01.00.02.00.00.01.0 The text evokes a sense of contemplative...
7 You know what happens when I get one of these stupid ideas i... 1.0 2.02.02.03.02.01.02.0 The text expresses frustration and annoy...
8 My stomach even started giving me fits.... 2.0 2.02.02.02.02.02.01.0 The phrase "stomach giving me fits&...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 ürünü hepsiburadadan alalı 3 hafta oldu. orjinal ve eksiksiz... 2.0 2.02.02.01.02.02.02.0 Metin son derece olumlu bir duygu durumu...
2 ürünlerden çok memnunum, kesinlikle herkese tavsiye ederim. ... 2.0 2.02.02.02.02.02.02.0 Metin tamamen pozitif bir duygu ifade ed...
3 hızlı kargo, temiz alışveriş.teşekkür ederim.... 2.0 2.02.02.02.02.02.02.0 Metin tamamen olumlu bir duygu ifade edi...
4 Çünkü aranan tapınak bu bölgededir .... 1.0 1.01.02.00.01.01.01.0 Metin nötr bir ifade taşıyor. Bir bilgi ...
5 bu telefonu başlıca alma nedenlerim ise elimdeki samsung j5... 2.0 1.00.01.01.01.00.01.0 Metinde genel bir memnuniyetsizlik durum...
6 telefon gayet güzel ben kırmızı aldım baya şık görünüyor bu ... 2.0 2.02.02.01.02.02.02.0 Metinde genel olarak olumlu bir duygu ha...
7 Sert ve acımasız özelliklere sahiptir .... 1.0 0.00.00.00.00.00.00.0 Metinde "sert" ve "acımas...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 oyunculuk güzel ama sadece oyunculuk... konusunu hiç beğenme... 3.0 2.02.02.02.02.02.02.0 Metinde hem olumlu hem de olumsuz duygul...
2 Basit bir senaryo,güzel oyunculuk,güzel film sıkmıyor...... 3.0 4.04.04.04.04.03.04.0 Metin genel olarak olumlu bir duygu barı...
3 Sağlam bir uyarlama...... 5.0 3.04.02.05.05.03.04.0 Metin, "Sağlam bir uyarlama..."...
4 Yıllardır arşivimde beklerdi birkaç gün önce izledim ve nede... 5.0 4.04.04.04.04.04.04.0 Metinde genel olarak olumlu bir duygu ha...
5 Ben filmi beğendim,özellikle 3d izlemenizi tavsiye ediyorum.... 4.5 4.04.05.05.05.05.04.0 Metin genel olarak pozitif bir duygu dur...
6 Film demeye bile zorlanabileceğiniz bir aksiyon çukuru...10/... 2.0 1.01.03.02.00.01.00.0 Metin oldukça eleştirel bir duygu barınd...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 Colorado, middle of nowhere.... 1.0 1.10.00.50.00.20.20.5 The phrase "middle of nowhere"...
2 This involved swimming a pretty large lake that was over my ... 2.0 1.81.21.22.51.21.21.2 The text conveys a sense of mild anxiety...
3 It was one of my most shameful experiences.... 3.0 2.12.52.52.82.22.22.5 The word "shameful" carries a ...
4 After all, I had vegetables coming out my ears all for the b... 0.0 1.91.21.20.00.20.81.2 The text expresses a mild to moderate le...
5 Then the screaming started.... 3.0 2.62.82.53.02.82.82.5 The single sentence, “Then the screaming...
6 They don't fear death, and it seems they believe in reincarn... 1.0 1.11.20.50.00.20.81.0 The text evokes a sense of peaceful acce...
7 You know what happens when I get one of these stupid ideas i... 1.0 1.91.21.22.81.81.41.2 The text expresses frustration and a hin...
8 My stomach even started giving me fits.... 2.0 1.82.11.22.81.21.81.2 The phrase "stomach giving me fits&...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 Colorado, middle of nowhere.... 1.0 1.10.50.50.00.20.20.5 The phrase "middle of nowhere"...
2 This involved swimming a pretty large lake that was over my ... 2.0 1.81.21.22.81.21.21.2 The text evokes a sense of mild anxiety ...
3 It was one of my most shameful experiences.... 3.0 2.12.52.52.52.22.22.5 The word "shameful" carries a ...
4 After all, I had vegetables coming out my ears all for the b... 0.0 1.81.21.21.50.20.81.2 The text expresses a mild to moderate le...
5 Then the screaming started.... 3.0 2.62.82.83.02.82.82.5 The single sentence, “Then the screaming...
6 They don't fear death, and it seems they believe in reincarn... 1.0 1.11.20.50.50.20.81.2 The text evokes a sense of philosophical...
7 You know what happens when I get one of these stupid ideas i... 1.0 2.11.81.82.81.81.41.2 The text expresses frustration and a hin...
8 My stomach even started giving me fits.... 2.0 1.81.81.22.81.21.81.2 The phrase "stomach giving me fits&...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 Colorado, middle of nowhere.... 1.0 1.10.51.22.00.20.20.5 The phrase "middle of nowhere"...
2 This involved swimming a pretty large lake that was over my ... 2.0 1.82.10.22.01.21.41.0 The text evokes a mild sense of apprehen...
3 It was one of my most shameful experiences.... 3.0 2.12.62.52.82.22.22.2 The word "shameful" carries si...
4 After all, I had vegetables coming out my ears all for the b... 0.0 1.21.21.20.00.20.81.2 The text conveys a mild sense of resentm...
5 Then the screaming started.... 3.0 2.62.82.43.02.82.82.5 The text conveys a very high level of em...
6 They don't fear death, and it seems they believe in reincarn... 1.0 1.11.21.22.20.20.81.2 The text evokes a sense of quiet contemp...
7 You know what happens when I get one of these stupid ideas i... 1.0 2.12.12.22.81.81.41.2 The text expresses frustration and a deg...
8 My stomach even started giving me fits.... 2.0 1.82.11.81.81.21.81.2 The phrase "stomach giving me fits&...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 29 yıl önce Hocalı'da Ermeni işgalcilerin vahşi katliamında,... 1.0 ----1.0-- {"score": 1, "explanation...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 29 yıl önce Hocalı'da Ermeni işgalcilerin vahşi katliamında,... 1.0 ----1.0-- {"score": 1, "explanation...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 ürünü hepsiburadadan alalı 3 hafta oldu. orjinal ve eksiksiz... 2.0 1.91.81.91.51.91.81.8 Metin genel olarak oldukça olumlu bir du...
2 ürünlerden çok memnunum, kesinlikle herkese tavsiye ederim. ... 2.0 1.91.92.01.51.91.91.9 Metin son derece olumlu bir duygu durumu...
3 hızlı kargo, temiz alışveriş.teşekkür ederim.... 2.0 1.91.91.81.51.81.81.8 Metin genel olarak olumlu bir duygu ifad...
4 Çünkü aranan tapınak bu bölgededir .... 1.0 1.01.01.01.51.21.01.0 Metin nötr bir bilgi cümlesidir. Bir keş...
5 bu telefonu başlıca alma nedenlerim ise elimdeki samsung j5... 2.0 0.70.71.21.51.20.81.0 Metinde genel bir memnuniyetsizlik durum...
6 telefon gayet güzel ben kırmızı aldım baya şık görünüyor bu ... 2.0 1.81.81.81.51.81.81.8 Metin genel olarak olumlu bir duygu duru...
7 Sert ve acımasız özelliklere sahiptir .... 1.0 0.10.40.01.50.00.30.1 Metin, "sert" ve "acıması...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 ürünü hepsiburadadan alalı 3 hafta oldu. orjinal ve eksiksiz... 2.0 1.91.81.91.51.81.81.8 Metin genel olarak oldukça olumlu bir du...
2 ürünlerden çok memnunum, kesinlikle herkese tavsiye ederim. ... 2.0 1.91.92.01.52.01.91.9 Metin oldukça pozitif bir duygu durumu i...
3 hızlı kargo, temiz alışveriş.teşekkür ederim.... 2.0 1.91.91.81.51.81.81.8 Metin genel olarak olumlu bir duygu ifad...
4 Çünkü aranan tapınak bu bölgededir .... 1.0 1.01.01.01.51.21.01.0 Metin nötr bir bilgi cümlesidir. Bir keş...
5 bu telefonu başlıca alma nedenlerim ise elimdeki samsung j5... 2.0 0.70.71.21.51.20.81.0 Metinde belirgin bir olumlu veya olumsuz...
6 telefon gayet güzel ben kırmızı aldım baya şık görünüyor bu ... 2.0 1.81.81.81.51.81.81.8 Metin genel olarak olumlu bir duygu içer...
7 Sert ve acımasız özelliklere sahiptir .... 1.0 0.10.40.01.50.00.30.1 Metin, "sert" ve "acıması...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 ürünü hepsiburadadan alalı 3 hafta oldu. orjinal ve eksiksiz... 2.0 1.91.91.91.51.91.81.9 Metin genel olarak oldukça olumlu bir du...
2 ürünlerden çok memnunum, kesinlikle herkese tavsiye ederim. ... 2.0 1.91.92.01.52.01.92.0 Metin oldukça pozitif bir duygu durumu i...
3 hızlı kargo, temiz alışveriş.teşekkür ederim.... 2.0 1.91.91.91.41.81.81.8 Metin genel olarak olumlu bir duygu duru...
4 Çünkü aranan tapınak bu bölgededir .... 1.0 1.01.01.01.51.21.01.0 Metin, kesin bir bilgi veya keşif ifades...
5 bu telefonu başlıca alma nedenlerim ise elimdeki samsung j5... 2.0 0.70.40.81.51.20.81.2 Metinde belirgin bir olumlu veya olumsuz...
6 telefon gayet güzel ben kırmızı aldım baya şık görünüyor bu ... 2.0 1.81.81.91.51.81.81.8 Metin genel olarak pozitif bir duygu ifa...
7 Sert ve acımasız özelliklere sahiptir .... 1.0 0.10.40.20.00.00.30.5 Metin genel olarak olumsuz bir duygu ifa...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 oyunculuk güzel ama sadece oyunculuk... konusunu hiç beğenme... 3.0 2.02.52.52.52.21.51.5 Metinde hem olumlu ("oyunculuk güze...
2 Basit bir senaryo,güzel oyunculuk,güzel film sıkmıyor...... 3.0 4.03.54.24.54.22.54.0 Metin genel olarak olumlu bir duygu ifad...
3 Sağlam bir uyarlama...... 5.0 3.53.52.54.54.23.54.0 Metin, "sağlam bir uyarlama..."...
4 Yıllardır arşivimde beklerdi birkaç gün önce izledim ve nede... 5.0 3.64.24.54.24.23.54.2 Metinde genel olarak olumlu bir duygu ha...
5 Ben filmi beğendim,özellikle 3d izlemenizi tavsiye ediyorum.... 4.5 4.24.24.84.54.84.54.2 Metin genel olarak oldukça olumlu bir du...
6 Film demeye bile zorlanabileceğiniz bir aksiyon çukuru...10/... 2.0 1.21.24.83.50.20.50.5 Metin genel olarak olumsuz bir duygu ifa...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 oyunculuk güzel ama sadece oyunculuk... konusunu hiç beğenme... 3.0 2.02.52.52.52.21.51.5 Metinde hem olumlu ("oyunculuk güze...
2 Basit bir senaryo,güzel oyunculuk,güzel film sıkmıyor...... 3.0 4.23.54.24.54.22.54.2 Metin genel olarak olumlu bir duygu ifad...
3 Sağlam bir uyarlama...... 5.0 3.53.53.54.54.23.54.0 Metin, "sağlam bir uyarlama..."...
4 Yıllardır arşivimde beklerdi birkaç gün önce izledim ve nede... 5.0 3.64.24.84.24.23.54.2 Metinde genel olarak olumlu bir duygu ha...
5 Ben filmi beğendim,özellikle 3d izlemenizi tavsiye ediyorum.... 4.5 4.24.24.84.54.84.54.2 Metin genel olarak oldukça olumlu bir du...
6 Film demeye bile zorlanabileceğiniz bir aksiyon çukuru...10/... 2.0 1.21.24.84.00.20.50.5 Metin genel olarak olumsuz bir duygu ifa...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 oyunculuk güzel ama sadece oyunculuk... konusunu hiç beğenme... 3.0 2.02.02.22.51.51.52.0 Metinde hem olumlu ("oyunculuk güze...
2 Basit bir senaryo,güzel oyunculuk,güzel film sıkmıyor...... 3.0 4.03.54.54.54.22.54.2 Metin genel olarak olumlu bir duygu duru...
3 Sağlam bir uyarlama...... 5.0 3.53.54.53.04.23.54.5 Metin genel olarak olumlu bir havaya sah...
4 Yıllardır arşivimde beklerdi birkaç gün önce izledim ve nede... 5.0 3.64.24.54.54.23.53.7 Metin genel olarak pozitif bir duygu dur...
5 Ben filmi beğendim,özellikle 3d izlemenizi tavsiye ediyorum.... 4.5 4.24.24.84.74.84.54.2 Metin genel olarak oldukça olumlu bir du...
6 Film demeye bile zorlanabileceğiniz bir aksiyon çukuru...10/... 2.0 1.21.25.04.00.20.50.5 Metin genel olarak olumsuz bir duygu ifa...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 Colorado, middle of nowhere.... 1.0 1.00.00.02.00.00.00.0 The phrase "middle of nowhere"...
2 This involved swimming a pretty large lake that was over my ... 2.0 1.02.02.02.02.01.01.0 The text evokes a mild sense of apprehen...
3 It was one of my most shameful experiences.... 3.0 2.03.03.03.03.03.03.0 The word "shameful" carries a ...
4 After all, I had vegetables coming out my ears all for the b... 0.0 2.01.02.00.01.00.01.0 The text expresses a clear sense of exas...
5 Then the screaming started.... 3.0 3.03.03.03.03.03.03.0 The single sentence, “Then the screaming...
6 They don't fear death, and it seems they believe in reincarn... 1.0 1.01.00.00.00.00.01.0 The text evokes a sense of quiet contemp...
7 You know what happens when I get one of these stupid ideas i... 1.0 2.02.02.02.02.01.02.0 The text expresses frustration and annoy...
8 My stomach even started giving me fits.... 2.0 2.02.02.03.02.02.01.0 The phrase "giving me fits" in...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 Colorado, middle of nowhere.... 1.0 1.00.00.00.00.00.00.0 The text evokes a sense of isolation or ...
2 This involved swimming a pretty large lake that was over my ... 2.0 1.01.02.01.01.01.01.0 The text evokes a mild sense of apprehen...
3 It was one of my most shameful experiences.... 3.0 2.03.03.03.03.03.03.0 The word "shameful" carries a ...
4 After all, I had vegetables coming out my ears all for the b... 0.0 2.01.02.00.01.00.01.0 The text expresses a mild frustration an...
5 Then the screaming started.... 3.0 3.03.03.03.03.03.03.0 The single sentence, “Then the screaming...
6 They don't fear death, and it seems they believe in reincarn... 1.0 1.01.00.02.00.00.01.0 The text evokes a sense of quiet contemp...
7 You know what happens when I get one of these stupid ideas i... 1.0 2.02.02.02.02.01.02.0 The text expresses frustration and annoy...
8 My stomach even started giving me fits.... 2.0 2.02.02.03.02.02.01.0 The phrase "stomach giving me fits&...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 Colorado, middle of nowhere.... 1.0 1.00.00.02.00.00.00.0 The text evokes a sense of isolation and...
2 This involved swimming a pretty large lake that was over my ... 2.0 1.01.02.02.01.01.01.0 The text conveys a mild sense of apprehe...
3 It was one of my most shameful experiences.... 3.0 2.03.02.03.02.03.02.0 The word "shameful" carries a ...
4 After all, I had vegetables coming out my ears all for the b... 0.0 2.01.01.00.01.00.01.0 The text conveys a sense of mild frustra...
5 Then the screaming started.... 3.0 3.03.03.03.03.03.02.0 The single sentence, "Then the scre...
6 They don't fear death, and it seems they believe in reincarn... 1.0 1.02.00.01.00.00.01.0 The text evokes a sense of calm acceptan...
7 You know what happens when I get one of these stupid ideas i... 1.0 2.02.02.02.02.01.02.0 The text expresses frustration and annoy...
8 My stomach even started giving me fits.... 2.0 2.02.02.02.02.02.01.0 The phrase "stomach giving me fits&...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 ürünü hepsiburadadan alalı 3 hafta oldu. orjinal ve eksiksiz... 2.0 ----2.0-- {"score": 2, "explanation...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 oyunculuk güzel ama sadece oyunculuk... konusunu hiç beğenme... 3.0 ----2.0-- {"score": 2, "explanation...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 Colorado, middle of nowhere.... 1.0 1.10.50.52.00.20.20.5 The phrase "middle of nowhere"...
2 This involved swimming a pretty large lake that was over my ... 2.0 1.81.21.21.51.21.21.2 The text evokes a sense of mild anxiety ...
3 It was one of my most shameful experiences.... 3.0 2.12.52.52.82.22.22.5 The word "shameful" carries a ...
4 After all, I had vegetables coming out my ears all for the b... 0.0 1.81.21.20.00.20.81.2 The text expresses a mild to moderate le...
5 Then the screaming started.... 3.0 2.62.82.53.02.82.82.5 The single sentence, “Then the screaming...
6 They don't fear death, and it seems they believe in reincarn... 1.0 1.11.20.50.00.20.81.2 The text evokes a sense of peaceful acce...
7 You know what happens when I get one of these stupid ideas i... 1.0 1.91.21.82.81.81.41.2 The text expresses frustration and a hin...
8 My stomach even started giving me fits.... 2.0 1.81.81.22.81.21.81.2 The phrase "stomach giving me fits&...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 Colorado, middle of nowhere.... 1.0 1.10.51.20.50.20.20.5 The phrase "middle of nowhere"...
2 This involved swimming a pretty large lake that was over my ... 2.0 1.81.81.21.51.21.21.2 The text evokes a sense of mild anxiety ...
3 It was one of my most shameful experiences.... 3.0 2.12.52.22.82.22.22.5 The word "shameful" carries a ...
4 After all, I had vegetables coming out my ears all for the b... 0.0 1.81.21.20.00.20.81.2 The text expresses a mild to moderate le...
5 Then the screaming started.... 3.0 2.62.82.53.02.82.82.5 The text evokes a strong sense of fear a...
6 They don't fear death, and it seems they believe in reincarn... 1.0 0.81.20.50.00.20.81.2 The text evokes a sense of peaceful acce...
7 You know what happens when I get one of these stupid ideas i... 1.0 1.91.21.82.81.81.41.2 The text expresses frustration and a hin...
8 My stomach even started giving me fits.... 2.0 1.82.11.82.81.21.81.2 The phrase "stomach giving me fits&...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 Colorado, middle of nowhere.... 1.0 1.10.01.20.50.20.20.5 The phrase "middle of nowhere"...
2 This involved swimming a pretty large lake that was over my ... 2.0 1.81.51.21.51.21.21.2 The text evokes a sense of mild anxiety ...
3 It was one of my most shameful experiences.... 3.0 2.22.62.42.82.22.22.5 The word "shameful" carries si...
4 After all, I had vegetables coming out my ears all for the b... 0.0 1.91.21.20.50.20.81.2 There's a discernible undercurrent of re...
5 Then the screaming started.... 3.0 2.62.82.23.02.82.82.5 The text conveys a very high level of em...
6 They don't fear death, and it seems they believe in reincarn... 1.0 1.11.20.51.00.20.81.2 The text evokes a sense of quiet contemp...
7 You know what happens when I get one of these stupid ideas i... 1.0 2.11.21.82.81.81.41.2 The text expresses frustration and a hin...
8 My stomach even started giving me fits.... 2.0 1.81.82.02.81.21.81.2 The phrase "stomach giving me fits&...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 Colorado, middle of nowhere.... 1.0 1.00.00.00.00.00.00.0 The phrase "middle of nowhere"...
2 This involved swimming a pretty large lake that was over my ... 2.0 1.02.02.01.01.01.01.0 The text evokes a mild sense of apprehen...
3 It was one of my most shameful experiences.... 3.0 2.03.03.03.02.03.03.0 The word "shameful" carries a ...
4 After all, I had vegetables coming out my ears all for the b... 0.0 2.01.02.00.01.00.01.0 The text expresses a clear sense of exas...
5 Then the screaming started.... 3.0 3.03.03.03.03.03.03.0 The single sentence, “Then the screaming...
6 They don't fear death, and it seems they believe in reincarn... 1.0 1.01.00.00.00.00.01.0 The text evokes a sense of peaceful acce...
7 You know what happens when I get one of these stupid ideas i... 1.0 2.02.02.02.02.01.02.0 The text expresses frustration and annoy...
8 My stomach even started giving me fits.... 2.0 2.02.02.03.02.02.01.0 The phrase "stomach giving me fits&...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 Colorado, middle of nowhere.... 1.0 1.00.00.00.00.00.00.0 The phrase "middle of nowhere"...
2 This involved swimming a pretty large lake that was over my ... 2.0 1.02.02.01.01.01.01.0 The text evokes a mild sense of apprehen...
3 It was one of my most shameful experiences.... 3.0 2.03.03.03.03.03.02.0 The word "shameful" carries a ...
4 After all, I had vegetables coming out my ears all for the b... 0.0 2.01.02.00.01.00.01.0 The text expresses a clear sense of **re...
5 Then the screaming started.... 3.0 3.03.03.03.03.03.03.0 The single sentence, “Then the screaming...
6 They don't fear death, and it seems they believe in reincarn... 1.0 1.01.00.02.00.00.01.0 The text evokes a sense of peaceful acce...
7 You know what happens when I get one of these stupid ideas i... 1.0 2.02.02.02.02.01.02.0 The text expresses frustration and annoy...
8 My stomach even started giving me fits.... 2.0 2.02.02.03.02.02.01.0 The phrase "stomach giving me fits&...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 Colorado, middle of nowhere.... 1.0 1.00.02.02.00.00.00.0 The phrase "middle of nowhere"...
2 This involved swimming a pretty large lake that was over my ... 2.0 1.02.02.01.02.01.01.0 The text evokes a mild sense of apprehen...
3 It was one of my most shameful experiences.... 3.0 2.03.03.03.02.03.02.0 The word "shameful" carries a ...
4 After all, I had vegetables coming out my ears all for the b... 0.0 1.01.03.02.01.00.01.0 The text expresses a mild sense of annoy...
5 Then the screaming started.... 3.0 3.03.03.03.03.03.03.0 The word "screaming" immediate...
6 They don't fear death, and it seems they believe in reincarn... 1.0 1.01.00.00.00.00.01.0 The text evokes a sense of peaceful acce...
7 You know what happens when I get one of these stupid ideas i... 1.0 2.02.02.03.02.01.02.0 The text expresses frustration and annoy...
8 My stomach even started giving me fits.... 2.0 2.02.03.03.02.02.01.0 The phrase "stomach giving me fits&...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 Colorado, middle of nowhere.... 1.0 1.10.50.22.00.20.20.5 The phrase "middle of nowhere"...
2 This involved swimming a pretty large lake that was over my ... 2.0 1.81.21.22.51.21.21.2 The text evokes a sense of mild anxiety ...
3 It was one of my most shameful experiences.... 3.0 2.12.52.52.82.22.22.5 The word "shameful" carries a ...
4 After all, I had vegetables coming out my ears all for the b... 0.0 1.81.21.20.00.20.81.2 The text expresses a mild sense of resen...
5 Then the screaming started.... 3.0 2.62.82.53.02.82.82.5 The single sentence, “Then the screaming...
6 They don't fear death, and it seems they believe in reincarn... 1.0 1.11.20.50.00.20.81.2 The text evokes a sense of calm acceptan...
7 You know what happens when I get one of these stupid ideas i... 1.0 1.91.21.82.51.81.41.2 The text expresses frustration and a hin...
8 My stomach even started giving me fits.... 2.0 1.8-1.22.81.21.8- The phrase "stomach giving me fits&...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 Colorado, middle of nowhere.... 1.0 1.10.50.50.00.20.20.5 The phrase "middle of nowhere"...
2 This involved swimming a pretty large lake that was over my ... 2.0 1.81.21.21.51.21.41.2 The text evokes a sense of mild anxiety ...
3 It was one of my most shameful experiences.... 3.0 2.22.52.52.82.22.22.5 The word "shameful" carries a ...
4 After all, I had vegetables coming out my ears all for the b... 0.0 1.81.21.20.00.20.81.2 The text conveys a mild sense of exasper...
5 Then the screaming started.... 3.0 2.62.82.83.02.82.82.5 The single sentence, "Then the scre...
6 They don't fear death, and it seems they believe in reincarn... 1.0 1.11.20.50.50.20.81.2 The text evokes a sense of peaceful acce...
7 You know what happens when I get one of these stupid ideas i... 1.0 1.91.82.22.81.81.41.2 The text expresses frustration and a hin...
8 My stomach even started giving me fits.... 2.0 1.9-1.22.81.2-- The phrase "stomach giving me fits&...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 Colorado, middle of nowhere.... 1.0 1.10.50.21.00.20.20.5 The phrase "middle of nowhere"...
2 This involved swimming a pretty large lake that was over my ... 2.0 1.81.22.21.51.21.21.0 The text evokes a sense of unease and po...
3 It was one of my most shameful experiences.... 3.0 2.12.52.51.52.22.22.5 The word "shameful" carries a ...
4 After all, I had vegetables coming out my ears all for the b... 0.0 1.91.21.12.50.80.81.1 The text expresses a mild to moderate le...
5 Then the screaming started.... 3.0 2.62.82.83.02.82.82.5 The text conveys a high degree of negati...
6 They don't fear death, and it seems they believe in reincarn... 1.0 0.81.20.51.50.20.81.2 The text evokes a sense of quiet contemp...
7 You know what happens when I get one of these stupid ideas i... 1.0 2.12.22.12.81.81.41.2 The text expresses frustration and annoy...
8 My stomach even started giving me fits.... 2.0 1.8-2.21.81.2-- The phrase "stomach giving me fits&...
# Text GT gemma-3-27bhermes-3-405llama-3.1-8bllama-3.2-3bllama-3.3-70mistral-smalgpt-4o-mini LLM Response
1 Colorado, middle of nowhere.... 1.0 ----1.0-- {"score": 1, "explanation...

🔬 Model × Dataset Detailed Comparison

Her model-dataset çifti için tüm kombinasyonlar (Strategy × Temperature × Scoring Mode). Satıra tıklayarak detayları görün.

No Input Text Actual Few 0.0°Few 0.0°Few 0.2°Few 0.2°Few 1.0°Few 1.0°Rubric 0.0°Rubric 0.0°Rubric 0.2°Rubric 0.2°Rubric 1.0°Rubric 1.0°Zero 0.0°Zero 0.0°Zero 0.2°Zero 0.2°Zero 1.0°Zero 1.0°
1 Colorado, middle of nowhere.... 1.0 1.11.01.11.01.11.01.11.01.11.01.11.01.11.01.11.01.11.0
2 This involved swimming a pretty large lake that wa... 2.0 1.81.01.81.01.81.01.81.01.81.01.81.01.81.01.81.01.81.0
3 It was one of my most shameful experiences.... 3.0 2.12.02.12.02.22.02.12.02.22.02.12.02.12.02.12.02.12.0
4 After all, I had vegetables coming out my ears all... 0.0 1.82.01.82.01.92.01.82.01.82.01.91.01.92.01.82.01.22.0
5 Then the screaming started.... 3.0 2.63.02.63.02.63.02.63.02.63.02.63.02.63.02.63.02.63.0
6 They don't fear death, and it seems they believe i... 1.0 1.11.00.81.01.11.01.11.01.11.00.81.01.11.01.11.01.11.0
7 You know what happens when I get one of these stup... 1.0 1.92.01.92.02.12.01.92.01.92.02.12.01.92.02.12.02.12.0
8 My stomach even started giving me fits.... 2.0 1.82.01.82.01.82.01.82.01.92.01.82.01.82.01.82.01.82.0
No Input Text Actual Few 0.0°Few 0.0°Few 0.2°Few 0.2°Few 1.0°Few 1.0°Rubric 0.0°Rubric 0.0°Rubric 0.2°Rubric 0.2°Rubric 1.0°Rubric 1.0°Zero 0.0°Zero 0.0°Zero 0.2°Zero 0.2°Zero 1.0°Zero 1.0°
1 I believe that technology has improved our lives s... 4.0 2.54.02.54.02.54.02.12.02.12.02.12.02.53.02.53.02.93.0
No Input Text Actual Few 0.0°Few 0.0°Few 0.2°Few 0.2°Few 1.0°Few 1.0°Rubric 0.0°Rubric 0.0°Rubric 0.2°Rubric 0.2°Rubric 1.0°Rubric 1.0°Zero 0.0°Zero 0.0°Zero 0.2°Zero 0.2°Zero 1.0°Zero 1.0°
1 oyunculuk güzel ama sadece oyunculuk... konusunu h... 3.0 2.02.02.02.02.02.02.02.02.02.02.02.02.02.02.02.02.02.0
2 Basit bir senaryo,güzel oyunculuk,güzel film sıkmı... 3.0 4.04.04.24.04.24.04.04.04.24.04.04.04.04.04.04.04.24.0
3 Sağlam bir uyarlama...... 5.0 3.53.03.53.03.53.03.53.03.53.03.53.03.53.03.54.03.53.0
4 Yıllardır arşivimde beklerdi birkaç gün önce izled... 5.0 3.64.03.64.03.64.03.64.03.64.03.64.03.64.03.64.03.64.0
5 Ben filmi beğendim,özellikle 3d izlemenizi tavsiye... 4.5 4.24.04.24.04.24.04.24.04.24.04.24.04.24.04.24.04.24.0
6 Film demeye bile zorlanabileceğiniz bir aksiyon çu... 2.0 1.21.01.21.01.21.01.21.01.21.01.21.01.21.01.21.01.51.0
7 güzel aşk filmlerinden biri.. unutulamayan aşklara... 4.0 -4.0-4.0-4.0------4.24.04.24.04.24.0
No Input Text Actual Few 0.0°Few 0.0°Few 0.2°Few 0.2°Few 1.0°Few 1.0°Rubric 0.0°Rubric 0.0°Rubric 0.2°Rubric 0.2°Rubric 1.0°Rubric 1.0°Zero 0.0°Zero 0.0°Zero 0.2°Zero 0.2°Zero 1.0°Zero 1.0°
1 29 yıl önce Hocalı'da Ermeni işgalcilerin vahşi ka... 1.0 0.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.0
2 Bugün yine floodluk bissürü çorbalar çıktı da yaza... 0.0 1.01.01.01.01.00.01.01.01.01.01.01.01.01.01.01.01.01.0
3 @USER 🍓🍋 ReisTarıma AtamayıAçıklıyor 🍒🍇 Gelecek n... 0.0 0.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.0
4 🌆tokyo revengers spoiler glb🌆 . . . . . . ya bu co... 1.0 1.01.01.01.01.01.01.01.01.01.01.01.01.01.01.01.01.01.0
5 Siyaset ölsün artıkın bıktık.Saltanat geri gelsin ... 1.0 1.01.01.01.01.01.01.01.01.01.01.01.01.01.01.01.01.01.0
6 @USER Boş duran,boş işlerle uğraşan daralır .Gözün... 1.0 0.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.0
7 @USER Umarım.. Sorsam Eylem'i sevmeyecek insanlari... 0.0 0.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.0
8 @322neco amk ergeni ne anlatıyon... 1.0 1.01.01.01.01.01.01.01.01.01.01.01.01.01.01.01.01.01.0
9 ÜZERİME GELİNLİK GİYİP , KAFAMA'DA MOTOSİKLET KAS... 0.0 0.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.0
No Input Text Actual Few 0.0°Few 0.0°Few 0.2°Few 0.2°Few 1.0°Few 1.0°Rubric 0.0°Rubric 0.0°Rubric 0.2°Rubric 0.2°Rubric 1.0°Rubric 1.0°Zero 0.0°Zero 0.0°Zero 0.2°Zero 0.2°Zero 1.0°Zero 1.0°
1 ürünü hepsiburadadan alalı 3 hafta oldu. orjinal v... 2.0 1.92.01.92.01.92.01.92.01.92.01.92.01.92.01.92.01.92.0
2 ürünlerden çok memnunum, kesinlikle herkese tavsiy... 2.0 1.92.01.92.01.92.01.92.01.92.01.92.01.92.01.92.01.92.0
3 hızlı kargo, temiz alışveriş.teşekkür ederim.... 2.0 1.92.01.92.01.92.01.92.01.92.01.92.01.92.01.92.01.92.0
4 Çünkü aranan tapınak bu bölgededir .... 1.0 1.01.01.01.01.01.01.01.01.01.01.01.01.01.01.01.01.01.0
5 bu telefonu başlıca alma nedenlerim ise elimdeki s... 2.0 0.71.00.71.00.71.00.71.00.71.00.71.00.71.00.71.00.71.0
6 telefon gayet güzel ben kırmızı aldım baya şık gör... 2.0 1.82.01.82.01.82.01.82.01.82.01.82.01.82.01.82.01.82.0
7 Sert ve acımasız özelliklere sahiptir .... 1.0 0.10.00.10.00.10.00.10.00.10.00.10.00.10.00.20.00.10.0
8 harika bir kokusu var ve tüm odayı sarıyor..... 2.0 -2.0-2.0-2.0------1.92.01.92.01.92.0
No Input Text Actual Few 0.0°Few 0.0°Few 0.2°Few 0.2°Few 1.0°Few 1.0°Rubric 0.0°Rubric 0.0°Rubric 0.2°Rubric 0.2°Rubric 1.0°Rubric 1.0°Zero 0.0°Zero 0.0°Zero 0.2°Zero 0.2°Zero 1.0°Zero 1.0°
1 Colorado, middle of nowhere.... 1.0 0.50.00.50.00.00.00.50.00.50.00.50.00.00.00.50.00.50.0
2 This involved swimming a pretty large lake that wa... 2.0 1.22.01.81.01.51.01.22.01.22.01.22.01.22.01.22.02.12.0
3 It was one of my most shameful experiences.... 3.0 2.53.02.53.02.63.02.53.02.53.02.53.02.53.02.53.02.63.0
4 After all, I had vegetables coming out my ears all... 0.0 1.21.01.21.01.21.01.21.01.21.01.21.01.21.01.21.01.21.0
5 Then the screaming started.... 3.0 2.83.02.83.02.83.02.83.02.83.02.83.02.83.02.83.02.83.0
6 They don't fear death, and it seems they believe i... 1.0 1.21.01.21.01.22.01.21.01.21.01.21.01.21.01.21.01.21.0
7 You know what happens when I get one of these stup... 1.0 1.22.01.22.01.22.01.22.01.82.02.22.01.22.01.82.02.12.0
8 My stomach even started giving me fits.... 2.0 1.82.02.12.01.82.0-2.0-2.0-2.02.12.01.82.02.12.0
No Input Text Actual Few 0.0°Few 0.0°Few 0.2°Few 0.2°Few 1.0°Few 1.0°Rubric 0.0°Rubric 0.0°Rubric 0.2°Rubric 0.2°Rubric 1.0°Rubric 1.0°Zero 0.0°Zero 0.0°Zero 0.2°Zero 0.2°Zero 1.0°Zero 1.0°
1 I believe that technology has improved our lives s... 4.0 4.54.04.54.04.54.02.52.02.52.02.02.04.54.04.54.04.54.0
No Input Text Actual Few 0.0°Few 0.0°Few 0.2°Few 0.2°Few 1.0°Few 1.0°Rubric 0.0°Rubric 0.0°Rubric 0.2°Rubric 0.2°Rubric 1.0°Rubric 1.0°Zero 0.0°Zero 0.0°Zero 0.2°Zero 0.2°Zero 1.0°Zero 1.0°
1 oyunculuk güzel ama sadece oyunculuk... konusunu h... 3.0 2.52.02.52.02.52.02.52.02.52.02.02.02.52.02.52.02.52.0
2 Basit bir senaryo,güzel oyunculuk,güzel film sıkmı... 3.0 3.54.03.54.03.54.03.54.03.54.03.54.03.54.03.54.03.54.0
3 Sağlam bir uyarlama...... 5.0 3.54.03.54.03.54.03.54.03.54.03.54.03.54.03.54.03.54.0
4 Yıllardır arşivimde beklerdi birkaç gün önce izled... 5.0 4.24.04.24.04.24.04.24.04.24.04.24.04.24.04.24.03.84.0
5 Ben filmi beğendim,özellikle 3d izlemenizi tavsiye... 4.5 4.24.04.24.03.84.04.24.04.24.04.24.04.24.04.24.04.24.0
6 Film demeye bile zorlanabileceğiniz bir aksiyon çu... 2.0 1.21.01.21.01.21.01.21.01.21.01.21.01.21.01.21.01.20.0
7 güzel aşk filmlerinden biri.. unutulamayan aşklara... 4.0 -4.0-4.0-4.0------4.24.04.25.04.54.0
No Input Text Actual Few 0.0°Few 0.0°Few 0.2°Few 0.2°Few 1.0°Few 1.0°Rubric 0.0°Rubric 0.0°Rubric 0.2°Rubric 0.2°Rubric 1.0°Rubric 1.0°Zero 0.0°Zero 0.0°Zero 0.2°Zero 0.2°Zero 1.0°Zero 1.0°
1 29 yıl önce Hocalı'da Ermeni işgalcilerin vahşi ka... 1.0 0.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.0
2 Bugün yine floodluk bissürü çorbalar çıktı da yaza... 0.0 0.00.00.00.00.00.00.00.00.00.00.01.00.00.00.00.01.00.0
3 @USER 🍓🍋 ReisTarıma AtamayıAçıklıyor 🍒🍇 Gelecek n... 0.0 0.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.0
4 🌆tokyo revengers spoiler glb🌆 . . . . . . ya bu co... 1.0 1.01.01.01.01.01.01.01.01.01.01.01.01.01.01.01.01.01.0
5 Siyaset ölsün artıkın bıktık.Saltanat geri gelsin ... 1.0 1.01.01.01.01.01.01.01.01.01.01.01.01.01.01.01.01.01.0
6 @USER Boş duran,boş işlerle uğraşan daralır .Gözün... 1.0 0.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.0
7 @USER Umarım.. Sorsam Eylem'i sevmeyecek insanlari... 0.0 0.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.0
8 @322neco amk ergeni ne anlatıyon... 1.0 1.01.01.01.01.01.01.01.01.01.01.01.01.01.01.01.01.01.0
9 ÜZERİME GELİNLİK GİYİP , KAFAMA'DA MOTOSİKLET KAS... 0.0 0.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.0
No Input Text Actual Few 0.0°Few 0.0°Few 0.2°Few 0.2°Few 1.0°Few 1.0°Rubric 0.0°Rubric 0.0°Rubric 0.2°Rubric 0.2°Rubric 1.0°Rubric 1.0°Zero 0.0°Zero 0.0°Zero 0.2°Zero 0.2°Zero 1.0°Zero 1.0°
1 ürünü hepsiburadadan alalı 3 hafta oldu. orjinal v... 2.0 1.92.01.92.01.82.01.82.01.82.01.92.01.82.01.82.01.82.0
2 ürünlerden çok memnunum, kesinlikle herkese tavsiy... 2.0 1.92.01.92.01.92.01.92.01.92.01.92.01.92.01.92.01.92.0
3 hızlı kargo, temiz alışveriş.teşekkür ederim.... 2.0 1.92.01.92.01.92.01.92.01.92.01.92.01.92.01.92.01.92.0
4 Çünkü aranan tapınak bu bölgededir .... 1.0 1.01.01.01.01.01.01.01.01.01.01.01.01.01.01.01.01.01.0
5 bu telefonu başlıca alma nedenlerim ise elimdeki s... 2.0 0.70.00.70.00.40.00.70.00.70.00.40.00.40.00.70.00.40.0
6 telefon gayet güzel ben kırmızı aldım baya şık gör... 2.0 1.82.01.82.01.82.01.82.01.82.01.82.01.82.01.82.01.82.0
7 Sert ve acımasız özelliklere sahiptir .... 1.0 0.40.00.40.00.20.00.40.00.40.00.40.00.40.00.40.00.20.0
8 harika bir kokusu var ve tüm odayı sarıyor..... 2.0 -2.0-2.0-2.0------1.82.01.82.01.82.0
No Input Text Actual Few 0.0°Few 0.0°Few 0.2°Few 0.2°Few 1.0°Few 1.0°Rubric 0.0°Rubric 0.0°Rubric 0.2°Rubric 0.2°Rubric 1.0°Rubric 1.0°Zero 0.0°Zero 0.0°Zero 0.2°Zero 0.2°Zero 1.0°Zero 1.0°
1 Colorado, middle of nowhere.... 1.0 0.50.01.20.01.20.00.20.00.50.00.22.00.50.00.50.01.20.0
2 This involved swimming a pretty large lake that wa... 2.0 1.22.01.22.01.22.01.22.01.22.02.22.01.22.01.22.00.22.0
3 It was one of my most shameful experiences.... 3.0 2.53.02.23.02.42.02.53.02.53.02.53.02.53.02.53.02.52.0
4 After all, I had vegetables coming out my ears all... 0.0 1.22.01.22.01.21.01.22.01.22.01.13.01.22.01.21.01.22.0
5 Then the screaming started.... 3.0 2.53.02.53.02.23.02.53.02.83.02.83.02.53.02.83.02.43.0
6 They don't fear death, and it seems they believe i... 1.0 0.50.00.50.00.50.00.50.00.50.00.50.00.50.00.50.01.20.0
7 You know what happens when I get one of these stup... 1.0 1.82.01.82.01.82.01.82.02.22.02.12.01.22.01.82.02.22.0
8 My stomach even started giving me fits.... 2.0 1.22.01.82.02.02.01.22.01.22.02.23.01.22.01.22.01.82.0
No Input Text Actual Few 0.0°Few 0.0°Few 0.2°Few 0.2°Few 1.0°Few 1.0°Rubric 0.0°Rubric 0.0°Rubric 0.2°Rubric 0.2°Rubric 1.0°Rubric 1.0°Zero 0.0°Zero 0.0°Zero 0.2°Zero 0.2°Zero 1.0°Zero 1.0°
1 I believe that technology has improved our lives s... 4.0 4.24.04.25.02.34.02.52.02.52.02.52.04.55.04.25.02.55.0
No Input Text Actual Few 0.0°Few 0.0°Few 0.2°Few 0.2°Few 1.0°Few 1.0°Rubric 0.0°Rubric 0.0°Rubric 0.2°Rubric 0.2°Rubric 1.0°Rubric 1.0°Zero 0.0°Zero 0.0°Zero 0.2°Zero 0.2°Zero 1.0°Zero 1.0°
1 oyunculuk güzel ama sadece oyunculuk... konusunu h... 3.0 2.02.01.82.02.52.02.52.02.52.02.22.01.52.02.52.02.02.0
2 Basit bir senaryo,güzel oyunculuk,güzel film sıkmı... 3.0 4.24.04.24.03.44.04.24.04.24.04.54.04.24.04.24.04.24.0
3 Sağlam bir uyarlama...... 5.0 2.05.02.55.04.55.02.55.03.55.04.52.02.55.03.05.03.55.0
4 Yıllardır arşivimde beklerdi birkaç gün önce izled... 5.0 4.85.04.85.04.84.04.55.04.85.04.54.04.85.04.85.04.55.0
5 Ben filmi beğendim,özellikle 3d izlemenizi tavsiye... 4.5 4.85.04.85.04.05.04.85.04.85.04.85.04.55.04.85.04.75.0
6 Film demeye bile zorlanabileceğiniz bir aksiyon çu... 2.0 4.85.05.05.01.25.04.85.04.85.05.03.04.85.04.85.00.55.0
7 güzel aşk filmlerinden biri.. unutulamayan aşklara... 4.0 4.55.04.55.0-5.0------4.55.04.25.04.74.0
No Input Text Actual Few 0.0°Few 0.0°Few 0.2°Few 0.2°Few 1.0°Few 1.0°Rubric 0.0°Rubric 0.0°Rubric 0.2°Rubric 0.2°Rubric 1.0°Rubric 1.0°Zero 0.0°Zero 0.0°Zero 0.2°Zero 0.2°Zero 1.0°Zero 1.0°
1 29 yıl önce Hocalı'da Ermeni işgalcilerin vahşi ka... 1.0 0.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.0
2 Bugün yine floodluk bissürü çorbalar çıktı da yaza... 0.0 1.01.01.01.01.01.01.01.01.01.01.01.01.01.01.01.01.01.0
3 @USER 🍓🍋 ReisTarıma AtamayıAçıklıyor 🍒🍇 Gelecek n... 0.0 0.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.0
4 🌆tokyo revengers spoiler glb🌆 . . . . . . ya bu co... 1.0 1.01.01.01.01.01.01.01.01.01.01.01.01.01.01.01.01.01.0
5 Siyaset ölsün artıkın bıktık.Saltanat geri gelsin ... 1.0 1.01.01.01.01.01.01.01.01.01.01.01.01.01.01.01.01.01.0
6 @USER Boş duran,boş işlerle uğraşan daralır .Gözün... 1.0 0.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.0
7 @USER Umarım.. Sorsam Eylem'i sevmeyecek insanlari... 0.0 1.01.01.01.00.01.01.01.00.01.01.00.01.01.01.01.00.01.0
8 @322neco amk ergeni ne anlatıyon... 1.0 1.01.01.01.01.01.01.01.01.01.01.01.01.01.01.01.01.01.0
9 ÜZERİME GELİNLİK GİYİP , KAFAMA'DA MOTOSİKLET KAS... 0.0 1.01.01.01.00.01.01.01.01.01.00.01.01.01.01.01.00.01.0
10 @smoketilimhigh3 @highenoughtodi Böyle ironiyi de ... 1.0 -------------1.0----
No Input Text Actual Few 0.0°Few 0.0°Few 0.2°Few 0.2°Few 1.0°Few 1.0°Rubric 0.0°Rubric 0.0°Rubric 0.2°Rubric 0.2°Rubric 1.0°Rubric 1.0°Zero 0.0°Zero 0.0°Zero 0.2°Zero 0.2°Zero 1.0°Zero 1.0°
1 ürünü hepsiburadadan alalı 3 hafta oldu. orjinal v... 2.0 1.92.01.92.02.02.01.92.01.92.01.92.01.92.01.92.01.92.0
2 ürünlerden çok memnunum, kesinlikle herkese tavsiy... 2.0 2.02.01.92.02.02.02.02.02.02.02.02.02.02.02.02.01.92.0
3 hızlı kargo, temiz alışveriş.teşekkür ederim.... 2.0 1.82.01.92.02.02.01.82.01.82.01.92.01.82.01.52.01.82.0
4 Çünkü aranan tapınak bu bölgededir .... 1.0 1.02.01.00.01.02.01.01.01.00.01.02.01.02.01.02.01.01.0
5 bu telefonu başlıca alma nedenlerim ise elimdeki s... 2.0 1.21.01.11.01.21.01.21.01.21.00.81.01.21.01.41.01.20.0
6 telefon gayet güzel ben kırmızı aldım baya şık gör... 2.0 1.82.01.82.01.82.01.82.01.82.01.92.01.82.01.82.01.62.0
7 Sert ve acımasız özelliklere sahiptir .... 1.0 0.00.00.00.00.00.00.00.00.00.00.20.00.00.00.00.00.00.0
8 harika bir kokusu var ve tüm odayı sarıyor..... 2.0 -2.0-2.0-2.0------1.82.01.92.02.02.0
No Input Text Actual Few 0.0°Few 0.0°Few 0.2°Few 0.2°Few 1.0°Few 1.0°Rubric 0.0°Rubric 0.0°Rubric 0.2°Rubric 0.2°Rubric 1.0°Rubric 1.0°Zero 0.0°Zero 0.0°Zero 0.2°Zero 0.2°Zero 1.0°Zero 1.0°
1 Colorado, middle of nowhere.... 1.0 2.02.00.50.00.52.02.00.00.00.01.02.00.00.00.00.02.00.0
2 This involved swimming a pretty large lake that wa... 2.0 1.52.01.51.01.52.02.51.01.51.01.51.02.52.02.81.02.02.0
3 It was one of my most shameful experiences.... 3.0 2.83.02.83.02.83.02.83.02.83.01.53.02.83.02.53.02.83.0
4 After all, I had vegetables coming out my ears all... 0.0 0.00.00.00.00.50.00.00.00.00.02.52.00.00.01.50.00.00.0
5 Then the screaming started.... 3.0 3.03.03.03.03.03.03.03.03.03.03.03.03.03.03.03.03.03.0
6 They don't fear death, and it seems they believe i... 1.0 0.00.00.02.01.01.00.00.00.52.01.50.00.00.00.50.02.22.0
7 You know what happens when I get one of these stup... 1.0 2.82.02.82.02.82.02.52.02.82.02.83.02.82.02.82.02.83.0
8 My stomach even started giving me fits.... 2.0 2.83.02.83.02.82.02.83.02.83.01.83.02.83.02.83.01.82.0
No Input Text Actual Few 0.0°Few 0.0°Few 0.2°Few 0.2°Few 1.0°Few 1.0°Rubric 0.0°Rubric 0.0°Rubric 0.2°Rubric 0.2°Rubric 1.0°Rubric 1.0°Zero 0.0°Zero 0.0°Zero 0.2°Zero 0.2°Zero 1.0°Zero 1.0°
1 I believe that technology has improved our lives s... 4.0 4.24.04.24.04.24.02.52.02.53.04.54.04.54.04.54.04.55.0
No Input Text Actual Few 0.0°Few 0.0°Few 0.2°Few 0.2°Few 1.0°Few 1.0°Rubric 0.0°Rubric 0.0°Rubric 0.2°Rubric 0.2°Rubric 1.0°Rubric 1.0°Zero 0.0°Zero 0.0°Zero 0.2°Zero 0.2°Zero 1.0°Zero 1.0°
1 oyunculuk güzel ama sadece oyunculuk... konusunu h... 3.0 2.52.01.52.03.22.02.52.02.52.02.52.02.52.02.52.00.32.0
2 Basit bir senaryo,güzel oyunculuk,güzel film sıkmı... 3.0 4.54.04.54.04.55.04.54.04.54.04.54.04.54.04.55.04.84.0
3 Sağlam bir uyarlama...... 5.0 4.55.04.55.03.05.04.55.04.55.03.05.04.55.04.55.04.53.0
4 Yıllardır arşivimde beklerdi birkaç gün önce izled... 5.0 4.54.04.54.04.53.04.24.04.24.04.54.04.54.04.54.03.84.0
5 Ben filmi beğendim,özellikle 3d izlemenizi tavsiye... 4.5 4.55.04.55.02.55.04.55.04.55.04.75.04.55.04.55.04.85.0
6 Film demeye bile zorlanabileceğiniz bir aksiyon çu... 2.0 3.53.01.53.03.23.03.53.04.03.04.02.01.53.02.53.01.83.0
7 güzel aşk filmlerinden biri.. unutulamayan aşklara... 4.0 4.54.0-4.0-4.0------4.54.04.54.04.54.0
No Input Text Actual Few 0.0°Few 0.0°Few 0.2°Few 0.2°Few 1.0°Few 1.0°Rubric 0.0°Rubric 0.0°Rubric 0.2°Rubric 0.2°Rubric 1.0°Rubric 1.0°Zero 0.0°Zero 0.0°Zero 0.2°Zero 0.2°Zero 1.0°Zero 1.0°
1 29 yıl önce Hocalı'da Ermeni işgalcilerin vahşi ka... 1.0 0.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.0
2 Bugün yine floodluk bissürü çorbalar çıktı da yaza... 0.0 1.01.01.01.01.01.01.01.01.01.01.01.01.01.01.01.01.01.0
3 @USER 🍓🍋 ReisTarıma AtamayıAçıklıyor 🍒🍇 Gelecek n... 0.0 1.01.00.01.00.01.01.01.01.01.00.01.01.00.00.00.01.00.0
4 🌆tokyo revengers spoiler glb🌆 . . . . . . ya bu co... 1.0 1.01.01.01.01.01.01.01.01.01.01.01.01.01.01.01.01.01.0
5 Siyaset ölsün artıkın bıktık.Saltanat geri gelsin ... 1.0 1.01.01.01.01.01.01.01.01.01.01.01.01.01.01.01.01.01.0
6 @USER Boş duran,boş işlerle uğraşan daralır .Gözün... 1.0 0.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.0
7 @USER Umarım.. Sorsam Eylem'i sevmeyecek insanlari... 0.0 1.01.01.01.01.01.01.01.01.01.01.01.01.01.01.01.01.01.0
8 @322neco amk ergeni ne anlatıyon... 1.0 1.01.01.01.01.01.01.01.01.01.01.01.01.01.01.01.01.01.0
9 ÜZERİME GELİNLİK GİYİP , KAFAMA'DA MOTOSİKLET KAS... 0.0 1.01.01.01.01.01.01.01.01.01.0-1.01.01.01.01.01.01.0
No Input Text Actual Few 0.0°Few 0.0°Few 0.2°Few 0.2°Few 1.0°Few 1.0°Rubric 0.0°Rubric 0.0°Rubric 0.2°Rubric 0.2°Rubric 1.0°Rubric 1.0°Zero 0.0°Zero 0.0°Zero 0.2°Zero 0.2°Zero 1.0°Zero 1.0°
1 ürünü hepsiburadadan alalı 3 hafta oldu. orjinal v... 2.0 1.51.01.51.01.51.01.51.01.52.01.51.01.52.01.52.01.42.0
2 ürünlerden çok memnunum, kesinlikle herkese tavsiy... 2.0 1.52.01.52.01.52.01.52.01.52.01.52.01.52.01.52.01.52.0
3 hızlı kargo, temiz alışveriş.teşekkür ederim.... 2.0 1.52.01.51.01.21.01.52.01.52.01.42.01.52.01.52.01.52.0
4 Çünkü aranan tapınak bu bölgededir .... 1.0 1.51.01.51.00.01.01.51.01.51.01.50.01.51.01.51.01.52.0
5 bu telefonu başlıca alma nedenlerim ise elimdeki s... 2.0 1.51.01.51.01.20.01.51.01.51.01.51.01.51.01.51.01.01.0
6 telefon gayet güzel ben kırmızı aldım baya şık gör... 2.0 1.51.01.51.01.51.01.51.01.51.01.51.01.51.01.51.01.91.0
7 Sert ve acımasız özelliklere sahiptir .... 1.0 1.50.00.00.01.20.01.50.01.50.00.00.00.00.01.50.01.50.0
8 harika bir kokusu var ve tüm odayı sarıyor..... 2.0 -2.0-2.0-2.0------1.52.01.52.01.02.0
No Input Text Actual Few 0.0°Few 0.0°Few 0.2°Few 0.2°Few 1.0°Few 1.0°rag_ru 0.0°Rubric 0.0°Rubric 0.0°Rubric 0.2°Rubric 0.2°Rubric 1.0°Rubric 1.0°Zero 0.0°Zero 0.0°Zero 0.2°Zero 0.2°Zero 1.0°Zero 1.0°
1 Colorado, middle of nowhere.... 1.0 0.20.00.20.00.20.01.00.20.00.20.00.20.00.20.00.20.00.20.0
2 This involved swimming a pretty large lake that wa... 2.0 1.22.01.21.01.21.0-1.21.01.21.01.22.01.21.01.21.01.22.0
3 It was one of my most shameful experiences.... 3.0 2.23.02.23.02.22.0-2.22.02.23.02.22.02.23.02.22.02.22.0
4 After all, I had vegetables coming out my ears all... 0.0 0.21.00.21.00.21.0-0.21.00.21.00.81.00.21.00.21.00.21.0
5 Then the screaming started.... 3.0 2.83.02.83.02.83.0-2.83.02.83.02.83.02.83.02.83.02.83.0
6 They don't fear death, and it seems they believe i... 1.0 0.20.00.20.00.20.0-0.20.00.20.00.20.00.20.00.20.00.20.0
7 You know what happens when I get one of these stup... 1.0 1.82.01.82.01.82.0-1.82.01.82.01.82.01.82.01.82.01.82.0
8 My stomach even started giving me fits.... 2.0 1.22.01.22.01.22.0-1.22.01.22.01.22.01.22.01.22.01.22.0
No Input Text Actual Few 0.0°Few 0.0°Few 0.2°Few 0.2°Few 1.0°Few 1.0°rag_ru 0.0°rag_ru 0.2°Rubric 0.0°Rubric 0.0°Rubric 0.2°Rubric 0.2°Rubric 1.0°Rubric 1.0°Zero 0.0°Zero 0.0°Zero 0.2°Zero 0.2°Zero 1.0°Zero 1.0°
1 I believe that technology has improved our lives s... 4.0 2.54.02.54.02.14.02.02.02.12.02.12.02.12.02.54.02.54.02.54.0
No Input Text Actual Few 0.0°Few 0.0°Few 0.2°Few 0.2°Few 1.0°Few 1.0°rag_ru 0.0°Rubric 0.0°Rubric 0.0°Rubric 0.2°Rubric 0.2°Rubric 1.0°Rubric 1.0°Zero 0.0°Zero 0.0°Zero 0.2°Zero 0.2°Zero 1.0°Zero 1.0°
1 oyunculuk güzel ama sadece oyunculuk... konusunu h... 3.0 2.22.02.22.02.22.02.02.22.02.22.01.52.02.22.02.22.01.52.0
2 Basit bir senaryo,güzel oyunculuk,güzel film sıkmı... 3.0 4.24.04.24.04.24.0-4.24.04.25.04.24.04.24.04.24.04.24.0
3 Sağlam bir uyarlama...... 5.0 4.24.04.25.04.25.0-4.24.04.24.04.25.04.25.04.25.04.25.0
4 Yıllardır arşivimde beklerdi birkaç gün önce izled... 5.0 4.24.04.24.04.24.0-4.24.04.24.04.24.04.24.04.25.04.24.0
5 Ben filmi beğendim,özellikle 3d izlemenizi tavsiye... 4.5 4.85.04.85.04.85.0-4.85.04.85.04.85.04.85.04.85.04.85.0
6 Film demeye bile zorlanabileceğiniz bir aksiyon çu... 2.0 0.20.00.20.00.20.0-0.20.00.20.00.20.00.20.00.20.00.20.0
7 güzel aşk filmlerinden biri.. unutulamayan aşklara... 4.0 4.85.04.85.04.85.0-------4.85.04.85.04.85.0
No Input Text Actual Few 0.0°Few 0.0°Few 0.2°Few 0.2°Few 1.0°Few 1.0°rag_ru 0.0°rag_ru 0.2°Rubric 0.0°Rubric 0.0°Rubric 0.2°Rubric 0.2°Rubric 1.0°Rubric 1.0°Zero 0.0°Zero 0.0°Zero 0.2°Zero 0.2°Zero 1.0°Zero 1.0°
1 29 yıl önce Hocalı'da Ermeni işgalcilerin vahşi ka... 1.0 0.00.00.00.00.00.01.01.00.00.00.00.00.00.00.00.00.00.00.00.0
2 Bugün yine floodluk bissürü çorbalar çıktı da yaza... 0.0 0.00.00.00.00.00.0--0.00.00.00.00.01.00.00.01.00.00.00.0
3 @USER 🍓🍋 ReisTarıma AtamayıAçıklıyor 🍒🍇 Gelecek n... 0.0 0.00.00.00.00.00.0--0.00.00.00.00.00.00.00.00.00.00.00.0
4 🌆tokyo revengers spoiler glb🌆 . . . . . . ya bu co... 1.0 1.01.01.01.01.01.0--1.01.01.01.01.01.01.01.01.01.01.01.0
5 Siyaset ölsün artıkın bıktık.Saltanat geri gelsin ... 1.0 1.01.01.01.01.01.0--1.01.01.01.01.01.01.01.01.01.01.01.0
6 @USER Boş duran,boş işlerle uğraşan daralır .Gözün... 1.0 0.00.00.00.00.00.0--0.00.00.00.00.00.00.00.00.00.00.00.0
7 @USER Umarım.. Sorsam Eylem'i sevmeyecek insanlari... 0.0 0.00.00.00.00.00.0--0.00.00.00.01.00.00.00.00.00.00.00.0
8 @322neco amk ergeni ne anlatıyon... 1.0 1.01.01.01.01.01.0--1.01.01.01.01.01.01.01.01.01.01.01.0
9 ÜZERİME GELİNLİK GİYİP , KAFAMA'DA MOTOSİKLET KAS... 0.0 0.00.00.00.00.00.0--0.00.00.00.00.00.00.00.00.00.00.00.0
10 @smoketilimhigh3 @highenoughtodi Böyle ironiyi de ... 1.0 ---------------1.0-1.0-1.0
No Input Text Actual Few 0.0°Few 0.0°Few 0.2°Few 0.2°Few 1.0°Few 1.0°rag_ru 0.0°Rubric 0.0°Rubric 0.0°Rubric 0.2°Rubric 0.2°Rubric 1.0°Rubric 1.0°Zero 0.0°Zero 0.0°Zero 0.2°Zero 0.2°Zero 1.0°Zero 1.0°
1 ürünü hepsiburadadan alalı 3 hafta oldu. orjinal v... 2.0 1.82.01.82.01.92.02.01.82.01.92.01.92.01.92.01.82.01.82.0
2 ürünlerden çok memnunum, kesinlikle herkese tavsiy... 2.0 1.92.01.92.02.02.0-2.02.01.92.02.02.01.92.02.02.02.02.0
3 hızlı kargo, temiz alışveriş.teşekkür ederim.... 2.0 1.82.01.82.01.82.0-1.82.01.82.01.82.01.82.01.82.01.82.0
4 Çünkü aranan tapınak bu bölgededir .... 1.0 1.21.01.21.01.21.0-1.21.01.21.01.21.01.21.01.21.01.21.0
5 bu telefonu başlıca alma nedenlerim ise elimdeki s... 2.0 1.21.01.21.01.21.0-1.21.01.21.01.21.01.21.01.21.01.20.0
6 telefon gayet güzel ben kırmızı aldım baya şık gör... 2.0 1.82.01.82.01.82.0-1.82.01.82.01.82.01.82.01.82.01.82.0
7 Sert ve acımasız özelliklere sahiptir .... 1.0 0.00.00.00.00.00.0-0.00.00.00.00.00.00.00.00.00.00.00.0
8 harika bir kokusu var ve tüm odayı sarıyor..... 2.0 -2.0-2.0-2.0-------1.82.01.82.01.82.0
No Input Text Actual Few 0.0°Few 0.0°Few 0.2°Few 0.2°Few 1.0°Few 1.0°Rubric 0.0°Rubric 0.0°Rubric 0.2°Rubric 0.2°Rubric 1.0°Rubric 1.0°Zero 0.0°Zero 0.0°Zero 0.2°Zero 0.2°Zero 1.0°Zero 1.0°
1 Colorado, middle of nowhere.... 1.0 0.20.00.20.00.20.00.20.00.20.00.20.00.20.00.20.00.20.0
2 This involved swimming a pretty large lake that wa... 2.0 1.21.01.21.01.21.01.21.01.41.01.21.01.21.01.21.01.41.0
3 It was one of my most shameful experiences.... 3.0 2.23.02.23.02.23.02.23.02.23.02.23.02.23.02.23.02.23.0
4 After all, I had vegetables coming out my ears all... 0.0 0.80.00.80.00.80.00.80.00.80.00.80.00.80.00.80.00.80.0
5 Then the screaming started.... 3.0 2.83.02.83.02.83.02.83.02.83.02.83.02.83.02.83.02.83.0
6 They don't fear death, and it seems they believe i... 1.0 0.80.00.80.00.80.00.80.00.80.00.80.00.80.00.80.00.80.0
7 You know what happens when I get one of these stup... 1.0 1.41.01.41.01.41.01.41.01.41.01.41.01.41.01.41.01.41.0
8 My stomach even started giving me fits.... 2.0 1.82.01.82.01.82.01.82.0-2.0-2.01.82.01.82.01.82.0
No Input Text Actual Few 0.0°Few 0.0°Few 0.2°Few 0.2°Few 1.0°Few 1.0°Rubric 0.0°Rubric 0.0°Rubric 0.2°Rubric 0.2°Rubric 1.0°Rubric 1.0°Zero 0.0°Zero 0.0°Zero 0.2°Zero 0.2°Zero 1.0°Zero 1.0°
1 I believe that technology has improved our lives s... 4.0 2.53.02.53.02.53.02.22.02.22.02.22.02.53.02.53.02.53.0
No Input Text Actual Few 0.0°Few 0.0°Few 0.2°Few 0.2°Few 1.0°Few 1.0°Rubric 0.0°Rubric 0.0°Rubric 0.2°Rubric 0.2°Rubric 1.0°Rubric 1.0°Zero 0.0°Zero 0.0°Zero 0.2°Zero 0.2°Zero 1.0°Zero 1.0°
1 oyunculuk güzel ama sadece oyunculuk... konusunu h... 3.0 1.52.01.52.01.52.01.52.01.52.01.52.01.52.01.52.01.52.0
2 Basit bir senaryo,güzel oyunculuk,güzel film sıkmı... 3.0 2.53.02.53.02.53.02.53.02.53.02.53.02.53.02.53.02.53.0
3 Sağlam bir uyarlama...... 5.0 3.53.03.53.03.53.03.53.03.53.03.53.03.53.03.53.03.53.0
4 Yıllardır arşivimde beklerdi birkaç gün önce izled... 5.0 3.54.03.54.03.54.03.54.03.54.03.54.03.54.03.54.03.54.0
5 Ben filmi beğendim,özellikle 3d izlemenizi tavsiye... 4.5 4.55.04.55.04.55.04.55.04.55.04.55.04.55.04.55.04.55.0
6 Film demeye bile zorlanabileceğiniz bir aksiyon çu... 2.0 0.51.00.51.00.51.00.51.00.51.00.51.00.51.00.51.00.51.0
7 güzel aşk filmlerinden biri.. unutulamayan aşklara... 4.0 -5.0-5.0-5.0------4.55.04.55.04.55.0
No Input Text Actual Few 0.0°Few 0.0°Few 0.2°Few 0.2°Few 1.0°Few 1.0°Rubric 0.0°Rubric 0.0°Rubric 0.2°Rubric 0.2°Rubric 1.0°Rubric 1.0°Zero 0.0°Zero 0.0°Zero 0.2°Zero 0.2°Zero 1.0°Zero 1.0°
1 29 yıl önce Hocalı'da Ermeni işgalcilerin vahşi ka... 1.0 0.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.0
2 Bugün yine floodluk bissürü çorbalar çıktı da yaza... 0.0 1.01.01.01.01.01.01.01.01.01.01.01.01.01.01.01.01.01.0
3 @USER 🍓🍋 ReisTarıma AtamayıAçıklıyor 🍒🍇 Gelecek n... 0.0 0.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.0
4 🌆tokyo revengers spoiler glb🌆 . . . . . . ya bu co... 1.0 1.01.01.01.01.01.01.01.01.01.01.01.01.01.01.01.01.01.0
5 Siyaset ölsün artıkın bıktık.Saltanat geri gelsin ... 1.0 1.01.01.01.01.01.01.01.01.01.01.01.01.01.01.01.01.01.0
6 @USER Boş duran,boş işlerle uğraşan daralır .Gözün... 1.0 0.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.0
7 @USER Umarım.. Sorsam Eylem'i sevmeyecek insanlari... 0.0 0.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.0
8 @322neco amk ergeni ne anlatıyon... 1.0 1.01.01.01.01.01.01.01.01.01.01.01.01.01.01.01.01.01.0
9 ÜZERİME GELİNLİK GİYİP , KAFAMA'DA MOTOSİKLET KAS... 0.0 0.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.0
No Input Text Actual Few 0.0°Few 0.0°Few 0.2°Few 0.2°Few 1.0°Few 1.0°Rubric 0.0°Rubric 0.0°Rubric 0.2°Rubric 0.2°Rubric 1.0°Rubric 1.0°Zero 0.0°Zero 0.0°Zero 0.2°Zero 0.2°Zero 1.0°Zero 1.0°
1 ürünü hepsiburadadan alalı 3 hafta oldu. orjinal v... 2.0 1.82.01.82.01.82.01.82.01.82.01.82.01.82.01.82.01.82.0
2 ürünlerden çok memnunum, kesinlikle herkese tavsiy... 2.0 1.92.01.92.01.92.01.92.01.92.01.92.01.92.01.92.01.92.0
3 hızlı kargo, temiz alışveriş.teşekkür ederim.... 2.0 1.82.01.82.01.82.01.82.01.82.01.82.01.82.01.82.01.82.0
4 Çünkü aranan tapınak bu bölgededir .... 1.0 1.01.01.01.01.01.01.01.01.01.01.01.01.01.01.01.01.01.0
5 bu telefonu başlıca alma nedenlerim ise elimdeki s... 2.0 0.80.00.80.00.80.00.80.00.80.00.80.00.80.00.80.00.80.0
6 telefon gayet güzel ben kırmızı aldım baya şık gör... 2.0 1.82.01.82.01.82.01.82.01.82.01.82.01.82.01.82.01.82.0
7 Sert ve acımasız özelliklere sahiptir .... 1.0 0.30.00.30.00.30.00.30.00.30.00.30.00.30.00.30.00.30.0
8 harika bir kokusu var ve tüm odayı sarıyor..... 2.0 -2.0-2.0-2.0------1.92.01.92.01.92.0
No Input Text Actual Few 0.0°Few 0.0°Few 0.2°Few 0.2°Few 1.0°Few 1.0°Rubric 0.0°Rubric 0.0°Rubric 0.2°Rubric 0.2°Rubric 1.0°Rubric 1.0°Zero 0.0°Zero 0.0°Zero 0.2°Zero 0.2°Zero 1.0°Zero 1.0°
1 Colorado, middle of nowhere.... 1.0 0.50.00.50.00.50.00.50.00.50.00.50.00.50.00.50.00.51.0
2 This involved swimming a pretty large lake that wa... 2.0 1.21.01.21.01.21.01.21.01.21.01.01.01.21.01.21.01.02.0
3 It was one of my most shameful experiences.... 3.0 2.53.02.53.02.52.02.53.02.52.02.52.02.53.02.53.02.23.0
4 After all, I had vegetables coming out my ears all... 0.0 1.21.01.21.01.21.01.21.01.21.01.11.01.21.01.21.01.21.0
5 Then the screaming started.... 3.0 2.53.02.53.02.52.02.53.02.53.02.53.02.53.02.53.02.53.0
6 They don't fear death, and it seems they believe i... 1.0 1.21.01.21.01.21.01.21.01.21.01.21.01.01.01.21.01.21.0
7 You know what happens when I get one of these stup... 1.0 1.22.01.22.01.22.01.22.01.22.01.22.01.22.01.22.01.22.0
8 My stomach even started giving me fits.... 2.0 1.21.01.21.01.21.0-1.0-1.0-1.01.21.01.21.01.21.0
No Input Text Actual Few 0.0°Few 0.0°Few 0.2°Few 0.2°Few 1.0°Few 1.0°Rubric 0.0°Rubric 0.0°Rubric 0.2°Rubric 0.2°Rubric 1.0°Rubric 1.0°Zero 0.0°Zero 0.0°Zero 0.2°Zero 0.2°Zero 1.0°Zero 1.0°
1 I believe that technology has improved our lives s... 4.0 4.04.04.04.04.04.02.02.02.02.02.52.03.53.03.53.03.03.0
No Input Text Actual Few 0.0°Few 0.0°Few 0.2°Few 0.2°Few 1.0°Few 1.0°Rubric 0.0°Rubric 0.0°Rubric 0.2°Rubric 0.2°Rubric 1.0°Rubric 1.0°Zero 0.0°Zero 0.0°Zero 0.2°Zero 0.2°Zero 1.0°Zero 1.0°
1 oyunculuk güzel ama sadece oyunculuk... konusunu h... 3.0 1.52.01.52.02.01.01.52.01.52.02.02.02.02.02.02.02.02.0
2 Basit bir senaryo,güzel oyunculuk,güzel film sıkmı... 3.0 4.04.04.04.04.24.04.04.04.24.04.24.04.04.04.04.04.04.0
3 Sağlam bir uyarlama...... 5.0 4.04.04.04.04.04.04.04.04.04.04.54.04.04.04.04.04.04.0
4 Yıllardır arşivimde beklerdi birkaç gün önce izled... 5.0 4.24.04.24.04.24.04.24.04.24.03.74.04.24.04.24.04.24.0
5 Ben filmi beğendim,özellikle 3d izlemenizi tavsiye... 4.5 4.24.04.24.04.25.04.24.04.24.04.24.04.24.04.24.04.25.0
6 Film demeye bile zorlanabileceğiniz bir aksiyon çu... 2.0 1.20.01.20.00.20.00.50.00.50.00.50.00.50.00.50.00.10.0
7 güzel aşk filmlerinden biri.. unutulamayan aşklara... 4.0 -5.0-5.0-5.0------4.25.04.25.04.25.0
No Input Text Actual Few 0.0°Few 0.0°Few 0.2°Few 0.2°Few 1.0°Few 1.0°Rubric 0.0°Rubric 0.0°Rubric 0.2°Rubric 0.2°Rubric 1.0°Rubric 1.0°Zero 0.0°Zero 0.0°Zero 0.2°Zero 0.2°Zero 1.0°Zero 1.0°
1 29 yıl önce Hocalı'da Ermeni işgalcilerin vahşi ka... 1.0 0.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.0
2 Bugün yine floodluk bissürü çorbalar çıktı da yaza... 0.0 1.01.01.01.01.01.01.01.01.01.01.01.01.01.01.01.01.01.0
3 @USER 🍓🍋 ReisTarıma AtamayıAçıklıyor 🍒🍇 Gelecek n... 0.0 0.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.0
4 🌆tokyo revengers spoiler glb🌆 . . . . . . ya bu co... 1.0 1.01.01.01.01.01.01.01.01.01.01.01.01.01.01.01.01.01.0
5 Siyaset ölsün artıkın bıktık.Saltanat geri gelsin ... 1.0 1.01.01.01.01.01.01.01.01.01.01.01.01.01.01.01.01.01.0
6 @USER Boş duran,boş işlerle uğraşan daralır .Gözün... 1.0 0.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.0
7 @USER Umarım.. Sorsam Eylem'i sevmeyecek insanlari... 0.0 0.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.0
8 @322neco amk ergeni ne anlatıyon... 1.0 1.01.01.01.01.01.01.01.01.01.01.01.01.01.01.01.01.01.0
9 ÜZERİME GELİNLİK GİYİP , KAFAMA'DA MOTOSİKLET KAS... 0.0 0.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.00.0
No Input Text Actual Few 0.0°Few 0.0°Few 0.2°Few 0.2°Few 1.0°Few 1.0°Rubric 0.0°Rubric 0.0°Rubric 0.2°Rubric 0.2°Rubric 1.0°Rubric 1.0°Zero 0.0°Zero 0.0°Zero 0.2°Zero 0.2°Zero 1.0°Zero 1.0°
1 ürünü hepsiburadadan alalı 3 hafta oldu. orjinal v... 2.0 1.82.01.82.01.82.01.82.01.82.01.92.01.82.01.82.01.92.0
2 ürünlerden çok memnunum, kesinlikle herkese tavsiy... 2.0 2.02.01.92.01.92.01.92.01.92.02.02.01.92.02.02.01.82.0
3 hızlı kargo, temiz alışveriş.teşekkür ederim.... 2.0 1.82.01.82.01.82.01.82.01.82.01.82.01.82.01.82.01.82.0
4 Çünkü aranan tapınak bu bölgededir .... 1.0 1.01.01.01.01.01.01.01.01.01.01.01.01.01.01.01.01.01.0
5 bu telefonu başlıca alma nedenlerim ise elimdeki s... 2.0 1.01.01.01.00.81.01.01.01.01.01.21.01.01.01.01.01.01.0
6 telefon gayet güzel ben kırmızı aldım baya şık gör... 2.0 1.82.01.82.01.82.01.82.01.82.01.82.01.82.01.82.01.82.0
7 Sert ve acımasız özelliklere sahiptir .... 1.0 0.10.00.10.00.30.00.10.00.10.00.50.00.10.00.10.00.10.0
8 harika bir kokusu var ve tüm odayı sarıyor..... 2.0 -2.0-2.0-2.0------1.82.01.82.01.82.0

📝 Metrik Tanımları

MAE Mean Absolute Error (Ortalama Mutlak Hata)

LLM'in verdiği puanların gerçek puanlardan ne kadar saptığını ölçer. Her bir tahmin için hatanın mutlak değeri alınır ve ortalaması hesaplanır.
MAE = (1/n) × Σ |ŷᵢ − yᵢ|
Neden önemli: En sezgisel hata metriğidir. MAE = 0.5 demek, model ortalamada yarım puan hata yapıyor demektir. Ölçek bağımlıdır - 1-5 ölçeğinde 0.5 iyidir, 1-100 ölçeğinde çok iyidir.
📏 Aralık: 0 → ∞
↓ Düşük = İyi
✓ 1-5 ölçeğinde: MAE < 0.5 mükemmel, 0.5-1.0 iyi, > 1.0 geliştirilebilir

NMAE Normalized MAE (Normalize Edilmiş MAE)

MAE'yi puan aralığına bölerek 0-1 arasına normalize eder. Farklı ölçeklerdeki datasetleri karşılaştırmak için kullanılır.
NMAE = MAE / (max − min)
Neden önemli: 1-10 ölçeğinde MAE=0.9 ile 1-5 ölçeğinde MAE=0.36 aynı NMAE'ye sahiptir (0.1). Bu sayede farklı datasetler arasında adil karşılaştırma yapılabilir.
📏 Aralık: 0 → 1
↓ Düşük = İyi
✓ NMAE < 0.1 mükemmel, 0.1-0.2 iyi, > 0.2 geliştirilebilir

QWK Quadratic Weighted Kappa (Kuadratik Ağırlıklı Kappa)

İki değerlendirici arasındaki uyumu ölçer. Tesadüfi uyumu hesaba katar ve büyük farklara daha yüksek ceza verir.
κ = 1 − [Σ wᵢⱼ × Oᵢⱼ] / [Σ wᵢⱼ × Eᵢⱼ]
Neden önemli: Essay puanlama yarışmalarının (Kaggle ASAP) standart metriğidir. 1 puan hata ile 3 puan hatayı farklı ağırlıklandırır - büyük hatalar kuadratik olarak cezalandırılır.
📏 Aralık: −1 → +1
↑ Yüksek = İyi
<0.20 Zayıf 0.20-0.40 Orta 0.60-0.80 Çok İyi >0.80 Mükemmel

r Pearson Korelasyonu

Tahmin ve gerçek değerler arasındaki doğrusal ilişkinin gücünü ve yönünü ölçer. Birlikte artıp azalıyorlar mı?
r = Σ(x−x̄)(y−ȳ) / √[Σ(x−x̄)² × Σ(y−ȳ)²]
Dikkat: Yüksek korelasyon tek başına yeterli değil! Model tutarlı şekilde +2 yüksek puanlasa bile r yüksek olabilir. Bu yüzden Bias metriği de kontrol edilmeli.
📏 Aralık: −1 → +1
↑ +1'e yakın = İyi
r ≈ 1: Güçlü pozitif | r ≈ 0: İlişki yok | r ≈ −1: Ters ilişki

ρ Spearman Korelasyonu (Rank Korelasyonu)

Sıralama bazlı korelasyon. Değerlerin kendisi değil, sıralamasının tutarlılığını ölçer.
ρ = 1 − [6 × Σ dᵢ²] / [n(n² − 1)]
Pearson'dan farkı: Model 1-2-3-4 yerine 2-4-6-8 verse bile Spearman = 1 olur (sıralama aynı). Pearson ise düşük çıkar. Monoton ilişkileri yakalamada daha iyidir.
📏 Aralık: −1 → +1
↑ +1'e yakın = İyi
Sıralama tutarlılığını ölçer - hangisi daha iyi/kötü sorusuna cevap verir

Bias Yanlılık (Sistematik Hata)

Modelin ortalamada gerçekten yüksek mi yoksa düşük mü puan verdiğini gösterir.
Bias = ȳ_tahmin − ȳ_gerçek
Yorumlama: Bias = +0.8 ise model ortalama 0.8 puan fazla veriyor (cömert/lenient). Bias = −0.5 ise 0.5 puan eksik veriyor (sert/strict). İdeal değer 0'dır.
📏 Aralık: −∞ → +∞
≈ 0 = İyi
>0 Cömert (Lenient) ≈0 Dengeli <0 Sert (Strict)

ECE Expected Calibration Error (Kalibrasyon Hatası)

Modelin "eminlik" seviyesi ile gerçek doğruluk oranı arasındaki farkı ölçer. İyi kalibre edilmiş model, emin olduğunda doğru olmalı.
ECE = Σ (|Bₘ|/n) × |acc(Bₘ) − conf(Bₘ)|
Neden önemli: Model "%90 eminim" dediğinde gerçekten %90 doğru mu? Overconfident modeller yüksek güven verip hata yapar. Bu metrik bunu ölçer.
📏 Aralık: 0 → 1
↓ Düşük = İyi
<0.10 İyi ✓ 0.10-0.20 Orta >0.20 Kötü
📚 Guo et al. (2017) "On Calibration of Modern Neural Networks" - NeurIPS

Brier Brier Score (Olasılıksal Doğruluk)

Normalize edilmiş tahmin hatalarının karelerinin ortalaması. Hem doğruluğu hem de güveni tek metrikte birleştirir.
Brier = (1/n) × Σ (ŷ_norm − y_norm)²
Avantajı: MAE'den farklı olarak büyük hatalara daha fazla ceza verir (karesel). NMAE ile birlikte yorumlanmalıdır.
📏 Aralık: 0 → 1
↓ Düşük = İyi
<0.05 Mükemmel ✓ 0.05-0.10 İyi >0.10 Orta
📚 Brier (1950) "Verification of forecasts expressed in terms of probability"

Generated by LLM Benchmark Suite | 2025-12-31T13:18:05.353178